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PROVISIONAL SPECIFICATION 

Invention Title: 

Porphyromonas gingivalis antigens and probes 
The invention is described in the following statement: 



Porphyromonas gingivalis antigens and probes 
FIELD OF THE INVENTION 

The present invention relates to P. gingivalis nucleotide sequences, P. 
gingivalis polypeptides and probes for detection of P. gingivalis. 

BACKGROUND OF THE INVENTION 

Periodontal diseases are bacterial-associated inflammatory diseases of 
the supporting tissues of the teeth and range from the relatively mild form of 
gingivitis, the non-specific, reversible inflammation of gingival tissue to the 
more aggressive forms of periodontitis which are characterised by the 
destruction of the tooth's supporting structures. Periodontitis is associated 
with a subgingival infection of a consortium of specific Gram-negative 
bacteria that leads to the destruction of the periodontium and is a major 
public health problem. One bacterium that has attracted considerable 
interest is P. gingivalis as the recovery of this microorganism from adult 
periodontitis lesions can be up to 50% of the subgingival anaerobically 
cultivable flora, whereas P. gingivalis is rarely recovered, and then in low 
numbers, from healthy sites. A proportional increase in the level of 
P. gingivalis in subgingival plaque has been associated with an increased 
severity of periodontitis and eradication of the microorganism from the 
cultivable subgingival microbial population is accompanied by resolution of 
the disease. The progression of periodontitis lesions in non-human primates 
has been demonstrated with the subgingival implantation of P. gingivalis. 
These findings in both animals and humans suggest a major role for P. 
gingivalis in the development of adult periodontitis. 

P. gingivalis is a black-pigmented, anaerobic, asaccharolytic, 
proteolytic Gram-negative rod that obtains energy from the metabolism of 
specific amino acids. The microorganism has an absolute growth 
requirement for iron, preferentially in the form of haeme or its Fe(IH) 
oxidation product haemin and when grown under conditions of excess 
haemin is highly virulent in experimental animals. A number of virulence 
factors have been implicated in the pathogenicity of P. gingivalis including 
the capsule, adhesins, cytotoxins and extracellular hydrolytic enzymes. 



In order to develop an efficacious and safe vaccine to prevent 
P. gingivalis colonisation it is necessary to identify and produce antigens that 
are involved in virulence that have utility as immunogens to generate 
neutralising antibodies. Whilst it is possible to attempt to isolate antigens 
directly from cultures of P. gingivalis this is often difficult. For example as 
mentioned above, P. gingivalis is a strict anaerobe and can be difficult to 
isolate and grow. It is also known that, for a number of organisms, when 
cultured in vitro that many virulence genes are down regulated and the 
encoded proteins are no longer expressed. If conventional chemistry 
techniques were applied to purify vaccine candidates potentially important 
(protective) molecules may not be identified. With DNA sequencing, as the 
gene is present (but not transcribed) even when the organism is grown in 
vitro it can be identified, cloned and produced as a recombinant DNA 
protein. Similarly, a protective antigen or therapeutic target may be 
transiently expressed by the organism in vitro or produced in low levels 
making the identification of these molecules extremely difficult by 
conventional methods. 

With serological identification of therapeutic targets one is limited to 
those responses which are detectable using standard methods such as 
Western Blotting or ELISA. The limitation here is the both the level of 
response that is generated by the animal or human and determining whether 
this response is protective, damaging or irrelevant. No such limitation is 
present with a sequencing approach to the identification of potential 
therapeutic or prophylactic targets. 

It is also well known that P. gingivalis produces a range of broadly 
active proteases (University of Melbourne International Patent Application 
No PCT/AU 96/00673, US Patent Nos 5,475,097 and 5,523,390), which make 
the identification of intact proteins difficult because of their degradation by 
these proteases. 

SUMMARY OF THE INVENTION 

The present inventors have attempted to isolate P. gingivalis 
nucleotide sequences which can be used for recombinant production of 
P. gingivalis polypeptides and to develop nucleotide probes specific for 
P. gingivalis. The DNA sequences listed below have been selected from a 
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large number of P. gingivalis sequences according to their indicative 
potential as vaccine candidates. This intuitive step involved comparison of 
the deduced protein sequence from the P. gingivalis DNA sequences to the 
known protein sequence databases. Some of the characteristics used to 
select useful vaccine candidates include; the expected cellular location, such 
as outer membrane proteins or secreted proteins, particular functional 
activities of similar proteins such as those with an enzymatic or proteolytic 
activity, proteins involved in essential metabolic pathways that when 
inactivated or blocked may be deleterious or lethal to the organism, proteins 
that might be expected to play a role in the pathogenesis of the organism 
eg. red cell lysis, cell agglutination or cell receptors and proteins which are 
paralogues to proteins with proven vaccine efficacy. DNA sequences that 
were considered to be poor vaccine candidates and not selected include 
those that code for proteins involved in replication, non-essential proteins 
involved in cellular processes and those proteins present at sites that would 
be unlikely to be affected by immune mediators such as those found in the 
bacterial cytoplasm or inner membranes. 

In a first aspect the present invention consists in an isolated 
P. gingivalis nucleotide sequence, the nucleotide sequence consisting of or 
including a sequence selected from the group consisting of SEQ ID NO: 1, 
SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, 
SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9and sequences complementary 
thereto. 

In a second aspect the present invention consists in an isolated 
P. gingivalis polypeptide, the polypeptide being encoded by a nucleotide 
consisting of or including a sequence selected from the group consisting of 
SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, 
SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8 and SEQ ID NO: 9. 

In a third aspect the present invention consists in a nucleotide probe 
specific for P. gingivalis, the probe including a detectable label and a 
nucleotide sequence of at least 15 nucleotides, the nucleotide sequence 
being derived from a sequence selected from the group consisting of SEQ ID 
NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID 
NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, and sequences 
35 complementary thereto. 
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In a fourth aspect the present invention consists in an isolated 
P. gingivalis polypeptide, the polypeptide consisting of or including a 
sequence selected from the group consisting of SEQ ID NO: 10, SEQ ID 
NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, 
5 SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID 
NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, 
SEQ ID NO: 25, SEQ ID NO: 26, SEQ ID NO: 27. 

In a fifth aspect the present invention consists in an isolated 
polynucleotide, the polnucleotide encoding a polypeptide consisting of or 

10 including a sequence selected from the group consisting of SEQ ID NO: 10, 
SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID 
NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, 
SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID 
NO: 24, SEQ ID NO: 25, SEQ ED NO: 26, SEQ ID NO: 27. 

15 In a sixm aspect the present invention consists in a composition for 

use in raising an immune response in an animal directed against P. gingivalis 
the composition including an acceptable carrier and/or adjuvant and at least 
one polypeptide, the at least one polypeptide having a sequence selected 
from the group consisting of SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, 

20 SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 
17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID 
NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25, SEQ ID NO: 26, 
SEQ ID NO: 27. 



Throughout this specification, unless the context requires otherwise, 
the word "comprise", or variations such as "comprises" or "comprising", will 
be understood to imply the inclusion of a stated element or integer or group 
of elements or integers but not the exclusion of any other element or integer 
or group of elements or integers. 

DETAILED DESCRIPTION 

Preparation of the P. gingivalis library for sequencing. 

To determine the DNA sequence of P. gingivalis genomic DNA was isolated 
from P. gingivalis strain W50 (ATCC 53978) essentially by the method 
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described by Mamur J. (1961)(1). Cloning of DNA fragments was performed 
essentially as described by Fleischmann et al., (1995)(2). Briefly, purified 
genomic DNA from P. gingivalis was nebulized to fragment the DNA and 
was treated with Bal31 nuclease to create blunt ends then run twice on 
preparative 1% agarose gels. DNA fragments of 1.6-2.0 kb were excised from 
the gel and the DNA recovered. This DNA was then ligated to the vector 
pUClS (Smal digested and dephosphorylated; Pharmacia) and 
electrophoresed on a 1% agarose preparative gel. The fragment comprising 
linear vector plus one insert was excised, purified and this process repeated 
to reduce any vector without insert contamination. The recovered vector 
plus insert DNA was blunt-ended with T4 DNA polymerase, then a final 
ligation to produce circular DNA was performed. Aliquots of Epicurian Coli 
Electroporation-Competent Cells (Stratagene) were transformed with the 
library DNA and plated out on SOB agar antibiotic diffusion plates 
containing X-gal and incubated at 37°C overnight. Colonies with inserts 
appeared white and those without inserts (vector alone) appeared blue. 
Plates were stored at 4°C until the white clones were picked and expanded 
for the extraction of plasmid DNA for sequencing. 

DNA sequencing 

Plasmid DNA was prepared by picking bacterial colonies into 1.5ml of 
LB, TB or SOB broth supplemented with 50-100ug/ml Ampicillin in 96 deep 
well plates. Plasmid DNA was isolated using the QIAprep Spin or QIAprep 
96 Turbo miniprep kits (QIAGEN GmbH, Germany). DNA was eluted into a 
96 well gridded array and stored at -20C. 

Sequencing reactions were performed using ABI PRISM Dye 
Terminator and ABI PRISM BIGDye Terminator Cycle Sequencing Ready 
Reaction kits with AmpliTaq DNA polymerase FS (PE Applied Biosystems, 
Foster City, CA) using the M13 Universal forward and reverse sequencing 
primers. Sequence reactions were conducted on either a Perkin-Elmer 
GeneAmp 9700 (PE Applied Biosystems) or Hybaid PCR Express (Hybaid, 
UK) thermal cyclers. Sequencing reactions were analysed on ABI PRISM 
377 DNA sequencers (PE Applied Biosystems). 
The sequences obtained are set out below. 

SeqID#i-9 represents the DNA sequence that encodes proteins in 
SeqID#10-27. Proteins in SeqID#10-18 are the entire open reading frame 
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from DNA SeqID#l-9. Proteins in SeqID#19-27 are the proteins encoded by 
DNA SeqID#l-9 from their putative initiation codon. The initiation codon 
was calculated from sequence homology alignment using FastX or by the 
ORF prediction program GeneMark. 
5 As will be understood by those skilled in the art open reading frames 

(ORFs) may be readily identified. ORFs may be determined using two 
methods, for example, alignments from FastX search results may be used to 
define the start and end positions of coding regions if sufficient protein 
homology is present. Alternatively, protein coding regions may be identified 

10 using the ORF identification progam GeneMark (3) using a training matrix 
based on published P.gingivalis sequences. This matrix may be further 
refined by adding ORFs identified from the results of homology searching 
and ORFs identified by GeneMark. The program PSORT (4) may be used for 
the detection of signal sequences and the prediction of protein cell 

15 localisation. A UNIX version of TopPred (5) may be used to identify 
potential membrane spanning domains. 

DNA sequence analysis 

Raw trace data files from the ABI 377 sequencer were manually 
20 trimmed using Staden Pregap (Laboratory of Molecular Biology, Medical 

Research Council, UK) running on a Sun Microsystem computer. Trimmed 
files were assembled into contigs using Staden Gap v4.1 and exported as a 
consensus file in FastA format. This consensus was converted into GCG 
format files and analysed for homology using the FASTX algorithm on a non- 
25 redundant protein database compiled by ANGIS (Australian Genomic 
Information Service, University of Sydney). Individual FASTX search 
results were examined for significant homology by statistical probability and 
amino acid alignments. 

The results are set out in Table 1. 
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It will be appreciated by persons skilled in the art that numerous 
variations and/or modifications may be made to the invention as shown in 
the specific embodiments without departing from the spirit or scope of the 
invention as broadly described. The present embodiments are, therefore, to 
5 be considered in all respects as illustrative and not restrictive. 

Dated this 23rd day of April 1998 



CSL LIMITED 

Patent Attorneys for the Applicant: 
F.B. RICE & CO. 
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(2) INFORMATION FOR SEQ ID NO : 1 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 603 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: UNKNOWN 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: mis c_f eature 

(B) LOCATION 1. . . 603 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 



gaactaagca 
aatttcaagt 
gatgtactgg 
gcgtatttca 
ttcctcaaga 
gaagtcatta 
tatcacggta 
gccagtttcc 
gtaggatcca 
ggcgaacata 
aaa 



agtgttacat 
cttcgggcat 
aagaaaaagc 
tggatttgca 
taaatgcaca 
agatgggaga 
cgctcatcaa 
ctctaagagg 
agtggaaagt 
tcaaaccggg 



ggataaagtg 
cgacagcgtc 
ccctcagctc 
gcagaaggct 
caaggaaggt 
gggcccgaaa 
cggtatcgtt 
agttatagcc 
aactataccg 
tagtacgctc 



agctatgctc 
gttatggatg 
tcgtatgacg 
gtcaaactga 
gtgacgacct 
cccacccttt 
ttcgatagct 
ggctggacgg 
agcgatctgg 
atttttataa 



tgggattgag 
atttcatgca 
aggccaagcg 
acaaagaggc 
taccgagcgg 
cggacacggt 
ctatggacag 
agattcttca 
cgtatggaga 
tcgaattatt 



catcggtaat 
aggtctgtct 
cgaaatagag 
cggagaagaa 
cttgcaatac 
aacctgtcat 
gggagaaccg 
attaatgcct 
tcgtggtgcc 
gagtatcaac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
603 



(2) INFORMATION FOR SEQ ID NO : 2 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1119 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE : UNKNOWN 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . . 1119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 

gggtcttgtg gaagtagccc agcaattggc cgatcagggt gttcgcgttg tgatcgccgg 60 

attggacatg gactttcgac gtcagccttt cggacctatg ccgggcttgt gtgccatagc 120 

cgactccgtg accaaagttc atgccgtgtg tgtggaatgc ggccgattgg ccagctattc 180 

tttccgtcgt gtccaaggcg atcagcaagt gatgctgggc gaactgaacg aatacagtcc 240 
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cctctgcaga acctgctaca ggaaatgcag ttctccccca caaacagaag aaatccattc 300 

gacaatatga atagcagaca tctgacaatc acaatcattg ccggcctctc cctctttgta 360 

ctgacattgg gcggctgctc cgtagcccaa caagatacgc agtggactct cggcggaaag 420 

ctctttactt cggcgtggat acaacgttcg gccgaatatc aagcgctttg cattcaggca 480 

tacaacatcg ctacggaaag agtggacgct ctaccggcag aacgtaaaca aggagatagg 540 

ccttatgcca tcgtaacgga catagacgaa accattttgg acaatacgcc taactccgtg 600 

tatcaggctc tcaggggcaa ggattatgat gaagagactt gggggaaatg gtgtgcacag 660 

gccgatgccg acacactggc aggagctttg tctttcttcc tccatgcagc gaacaagggg 720 

atcgaggtct tttacgtcac caaccgcaga gacaatctgc gcgaagcaac tcttcagaac 780 

cttcagcgtt acggattccc ctttgccgat gaagaacatt tgcttacgac ccatgggcca 840 

tccgacaaag aaccccgtcg gctcaaaata caagaacagt atgaaatagt attgctcata 900 

ggagacaact tgggcgactt ccaccacttc ttcaatacga aagaagagtc cggacgcaaa 960 

caggctctgg gcctgacagc cggggagttt ggccggcact tcatcatgct gcccaatccc 1020 

aactacggat cttgggaacc ggcatggtac ggcgggaagt atccgccact gcccgaaaga 1080 

gacaaagcac ttaaacaact gcactcacag aacagcaga 1119 



(2) INFORMATION FOR SEQ ID NO : 3 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 933 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

( i i i ) H YPOTHET I CAL : NO 

(iv) ANTI-SENSE: UNKNOWN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . . 933 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 



ccatataatg tccaatctat tagcaacaag acgattaaaa aacaaatgga aaacttaaag 60 

aacattcagc ccagagagga tttcaactgg gaagagtttg aggccggtgg cgtccatgct 120 

gccgtgagtc gtcaggagca ggaagctgct tatgacaaaa cgctcaatac catcaaggaa 180 

aaggaagtgg taatgggtag ggtaactgct atcaacaagc gtgaagtggt tatcaatgta 240 

gggtacaaat cggaaggtgt ggtacctgca acagaattcc gctacaatcc cgaactcaaa 300 

gtgggagacg aagtggaagt ttatatcgag aatcaggaag ataagaaggg ccagctcgtc 360 

ttgtctcacc gcaaggctcg tgccgctcgc tcttgggagc gcgtgaacga ggctctcgaa 420 

aaagacgaaa tcgtaaaggg ctatgtgaag tgtcgtacca agggtggtat gatcgtcgat 480 

gtattcggta tcgaggcttt cctcccggga tcacagatcg acgtgcgccc cattcgcgac 540 

tacgatgcat tcgttgagaa gacgatggag ttcaagattg tgaaaatcaa tcaagaatat 600 

aagaatgtag ttgtttccca caaggtgctc atcgaagcag agctcgaaca acagaagaaa 660 

gaaatcatcg gcaagctcga aaaagggcag gtactcgaag gtatcgtcaa gaatattact 720 

tcttacggag tatttatcga cctcggtgga gtggatggtc ttatccatat cactgacctt 780 

tcatggggtc gtgtggctca tccggaagaa atcgtacagc tggatcagaa gatcaatgtc 840 

gttatcctcg actttgatga agatcgcaag cgtatcgctc tcggactcaa acagctgatg 900 

ccctcatcct tgggatgctc tcgacagcga get 933 



(2) INFORMATION FOR SEQ ID NO: 4 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2955 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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{ D) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: UNKNOWN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1...2 955 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 

gtaacagaca aaatgcaaaa caaaggattt gtgattgtta tcacatcggc tctggccatc 60 

atctgtgcgt tttacctgtc attctctttc gttacgaacc gttacgaaaa gaaggctaag 120 

gcgatgggcg atgttgccgg aatggcctat cttgattcca tgtcgaatga gaaggtctgg 180 

ttcggctaca cgctgaaaga agctcaagcc cagcaaattg gtcttggcct tgacttaaag 240 

gggggtatga acgttatctt gaaacttaac gcaagcgatc tgcttcgtaa cctctctaac 300 

aaaagtttgg atcccaactt caacaaagct ctggagaatg ctgccaagag cacggagcaa 360 

tccgacttca tcgatatttt cgtgaaggaa tatcgcaagc tcgatcccaa cggtcgcttg 420 

gccgttatct tcggttcggg tgaccttcgc gaccagatta ccgcaaagtc tacggatgca 480 

gacgtagtgc gtctgctcaa agaaaaatat aatagtgctg tagaagcttc gttcaatgtg 540 

ctccgtgctc gtatcgatgc tttcggtgtg gttgcaccta atttgcagcg attggaaggg 600 

caagggcgta tccttgtcga actccccgga gtgaaagacc ctgagcgtgt tcgtaccctt 660 

ttgcaacgca gtgccaacct acagttctgg cgtacataca aattcgaaga ggtcagcgga 720 

gacttgatcg ctgccaatga tcgtctgagc gaattggcta tgaacaacac ggatgctacc 780 

ccggaaacag agcctgcaac tactgactct gtagctgcaa cagccgattc tgctgctgta 840 

caagctgtag ctgattctgc tactgtagca caaaaagagg ccaaggatgc tactcgtaaa 900 

gacgcactct tctctctgct tactcccgtg aatcgtggcg gtgcagtagt gggtgtggct 960 

cgtcgtgcta atatggctca gatatctgaa atgctccagc aagctcacga tctgaaggtt 1020 

acacgtgaag atgtgctttt cctctggggt gctaaagcaa tcgaagaccc cgaaaccaaa 1080 

aaggagaccg acctctacga actctatgct attcgtacca atcgtacggg agatcctgat 1140 

ttgggaggtg atgtagtgac ttccgccaag agtgatatcc aaaatgactt cggtcgttcc 1200 

gaaccgatcg tttcgatgac gatgaatgaa gaaggtgctc gtaaatgggc gcgtatcaca 1260 

aaggataacg tgggacgggc aatcgctatc gttttggatg gtgtggttta ttctgctccg 1320 

aacgtgaatg atgagatcac gggcggtcgc tctcagatct ccgggcactt caccgtggag 1380 

gaggccggtg accttgccaa cgtactcaac tccggtaaaa tggatgctac ggtaagcatc 1440 

gaacaggaaa acgtgattgg tcctacgctg ggtgccgagt ccattaaagc cggattcttg 1500 

tcgttcctgc tcgctttggt tatcctgatg tgttacatgt gtctggctta cggtttcttg 1560 

ccgggtctta tcgcaaacgg cgcattgatt gtaaacagct tcttcacatt gggcgtattg 1620 

gcttctttcc atgccgtgct gaccctctcg ggtatcgcag gtttggtgct gacgctgggt 1680 

atggctgtgg atgccaacgt acttatcttc gagcgtatca aagaagagct tcgtgccggt 1740 

aagactccga ttcgtgccgt tacggatggt tatggcaacg ctttctctgc catcttcgac 1800 

tcgaacgtta cgactattat taccggtatc atcctattcc tctacgggac ggggccgatt 1860 

cgcggttttg ccactacgtt gattatcggt cttatcgctt ctttcattac ggctgtcttc 1920 

ttgactcgta tcgtcttcga gaaactggcg aaaaaaggtc gtttggataa gattacattc 1980 

actacgagca ttactcgcaa tctccttgtc aatccctcat acaacatctt gggtaagcgc 2040 

aagaccggct ttatcattcc ggtgattatc atcgttttgg gacttatagc ttcatttaca 2100 

atcggtctca ataggggtat tgaattctcc ggaggacgta actacgtagt taaattcgac 2160 

cagcctgtat cttccgaagc cgttcgttcg gccttgtctt ctcccctgca ggaaaaggta 2220 

ttggttacct ccatcggtac tgaagggaca gaggtgcgta tatctacgaa ctataagatc 2280 

caggaggaaa gcgaagaaac tgaagcagag attactgaca aattgtatca gagcctgaaa 2340 

ggtttctaca cccagcagcc tactgctgat cagttcttgg acaatatcat tagctctcag 2400 

aaagtaagtc ccagtatgtc gagtgacatc acgagaggtg ctatttgggc tgtgctgtta 2460 

tcgatgatct tcatggccat ttatattctg attcgcttcc gtgacatttc tttctctgcc 2520 

ggggtattcg tatctgtggc cgctactaca ttctgcatta ttgctctgta tgcgttgctg 2580 

tggaagattc tgcccttcac catggagatc gatcagaact tcatcgctgc tattctggct 2640 

atcatcggtt actcgctcaa tgacaccgtg gttgtatttg accgtatccg agagacgatg 2700 

aaattgtacc ccaacagaga tcgctatcag gtgatcaacg atgcccttaa ttcaacattg 2760 

ggtcgaacat taaatacgtc tttgactacg tttatcgtta tgttggtaat cttcatcttt 2820 
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ggaggtgcta cgatgcgtag tttcacgttc tcgatcctgc tcggtatcgt tatcggtaca 2880 
tactctacgc tctttgttgc tacacccctt gcctacgaga tccaaaagcg caagctcaac 2940 
aaagcagcta agaaa 2955 



(2) INFORMATION FOR SEQ ID NO: 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: UNKNOWN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .318 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 



ctttcgtaca gtggagagag cgatgcaaaa gagtctgatc agaattgccg gaaatgtacc 60 

ttcatcggct ttgaaaaacg agtaaatacg atgcgactga tcaaggcttt tctcgtgcaa 120 

ctcttactgc tccccatttt cttctacaag cggtttatat cgccgcttac accgccttca 180 

tgccggttta ccccctcatg ttcgtcctat gccatcgaag ccttacgtaa atatggcccg 240 

ggcaaaggac tattgctgag catcaagcgt attctccgct gtcacccgtg gggtggaagt 300 

ggctatgacc ccgttccg 318 



(2) INFORMATION FOR SEQ ID NO : 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1368 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: UNKNOWN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE : 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1...1368 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 

agaggaggac aaatccgccg acaccatacc gattcgtcaa ggggatcgga cagcaaagcc 60 

tgccgcttct cctttcatgc cgagaccatc ggtttctcca accaccaaag agcaaaaatg 120 

aattacctgt acatactgat tacactttta ctctccggct ttttttccgg tgctgagatt 180 

gctttccttt cttcggacaa actgcgtctt gagttggaca ggaatagagg cgatctcaca 240 

ggcagagcgt taaacttgct gtatcgacat ccggaccaac tggtgactac tctccttgtg 300 
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ggtaataata 
ttggcgcaat 
atcatcatac 
atgatgatga 
tctaaactct 
cctacaacag 
ggagaaaacg 
ttttccggta 
ttgcaaacgg 
attatctaca 
cgtgggcaag 
gccaataaac 
gaacttggag 
gacattgagg 
tatctggtca 
cctgagtctg 
ccacaggcca 
tccaccaaga 



tcgttttggt 
ggattgataa 
tgtttaccgg 
gggtattcgc 
tcaccggttt 
tagggttggg 
aacagaacga 
ttcaggtgcg 
atattgaagt 
gacagaacat 
actggcaaaa 
tgatgcgact 
gtacggccgg 
acgaacacga 
gtggtcgtat 
acgactacct 
atgaggtcgt 
tcgaactggt 



agtctatggt 
cgatgctatg 
ggaatttcta 
cctccctatc 
atctcgctct 
gcgcgtagat 
cttgactacc 
agactgcatg 
actcaaaacg 
agatgacgta 
acgtatcaat 
actcatgcag 
aatggtcaca 
cactcgcaag 
ggaaatagat 
taccgtggcc 
ggagattgct 
gaaaatgtcc 



ctgctgatgg 
atcgtcgttc 
cccaaagcca 
gtagcgatct 
tttattcgtc 
ctcgatcatt 
gaagtgaaaa 
atcccacgca 
acttttatcg 
gtaggatata 
actactgtat 
cgcaagaaaa 
ttagaggatt 
atcatagcca 
gatgtgaacg 
ggatttatcc 
ccttatactt 
atcgacgacc 



cgggattgct 
tccaatctgt 
ttttcaagac 
attatctgct 
tggtggacaa 
atttggcaga 
tcatccagaa 
atgagatgat 
ataccggttt 
tccattcgag 
tcgtacccga 
gcattgcgat 
tggtagaaga 
aacagctcgg 
aacgttttgg 
tgaatagcca 
ttaccattct 
aatcgaac 



ggccgcacct 
cttatccact 
caatgccaat 
ttatcctctg 
gaattatgtg 
aaatatgtcc 
tgcgctggat 
agcatgtgag 
gtccaagatc 
cgaaatgttt 
aagcatgtat 
cgtcatcgat 
gattttcggt 
ccctcatacc 
gttgtccttg 
tcaaaatatc 
cagatcttct 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1368 



(2) INFORMATION FOR SEQ ID NO: 7 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2037 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: UNKNOWN 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

<ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION 1...2037 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 



cctatccctt ttcttcttaa tatgtacagc ggacatcata aaatccatta tccttttctt 60 

atcctgttgg tatgccttgc ttttgctgcc tgcaagagcg tgaagttgaa agatgcggag 120 

aaggcacatg atcgccaaga gtataccaag gctgccgata tgtacaatac attatacagg 180 

cgtacccgac gaaagcaggt ggagatgaaa gcttatacgg ctttccgatc cggtgaaaac 240 

tatcgtgccg ccggcagaca agccaaagct ttgcgtggct atctgaatgc cagacgctac 300 

gggtatccgg attctgtggt actgctccgt ttggcacaga cttatcagca aggaggtaac 360 

tataaggaag ccgaggtact cttccgtgga tatctggaag cttatccgaa aagttatttt 420 

gcagctatcg gtttggaggg gtgtctcttt gcccgccagc aaaaggaata tcctacacgt 480 

taccggatac ggcgagctgc cgagtggaat tcggcacggg gcgacttcgg cccggcctat 540 

gcacccgatg cttcggctct ctatttcaca tcgagcagaa gcaaagacga cggtttggat 600 

aatagcagca taacgggact gaaacccaac gacatttata tcatcaaacg agatgcacaa 660 

ggacgatggg gacgtcccga tagcgtgtcc ggaggaatca acactccatg ggatgaaggc 720 

gtgccaacga tcacgcccga tggtagtacc atatattata cgttggcgca gcaaggagcc 780 

gattacgacc gtacggtaca gatctattcc gccgctcgga gcggagaagg cggttggagc 840 

aacggttcgc tcgtggacat tatgcgcgat tcgctccgta tggctgctca tccctctatg 900 

tcggcatccg gcgattacct gtatttcgtc agcaatatag gcggtagcta tggcggcaag 960 

gatatttatc gtgtcaaggt gtcggatcgt tcttatggtt caccggagaa tttggggcct 1020 

gatatcaata cgccggggga cgaaatgttt cccttcatag atggggatag tacccttttc 1080 

ttcgcttcgg acggacacgc cggtctggga ggactggata ttttcaaagc cacgctggac 1140 

tctaccggcc aatggcatgt agtcaatatg ggacaaccgg tcaattcctc tgccgatgat 1200 

ttcggcttgg ctgtggagcc taaaggcaaa aacaaagaag aagctttgcc ggacaacgga 1260 
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gtcaaaggtg tattttgttc caaccgaggc gatgcacgcg gatggccgca cctcttccat 1320 

ttcgaactgc cggctatcta caccgagatt caaggttatg tgatggacag agaagaaaat 1380 

cccatagccg gagccactgt caggatcgta ggcgaacgcg gccccgtagg acagggattc 1440 

gtgactactc gtgacgatgg ctcctataag atgagcgtgc agggcgatac tcgctatgta 1500 

atgcttgccg gagcatcggg ttatttgaat cagtacgtag aactcaagac cgataccgcc 1560 

aagcagagtg agacctacta tgtggacttt ttccttgcat cgcgtgagaa agccgagggc 1620 

ttgcaaaata ttttctatga tttcgataaa gctactcttc gccccgaaag catgaagagc 1680 

ttggacgaac tgattcgtat cctcacggac aatccggata ttcggatcga attgggttcg 1740 

catgccgaca ggaaaggccc cgatgcttac aacctcggac tatctgaccg cagagccaaa 1800 

tccgtggtgg attacctcac gagtcgtggc atagcggccg acaggcttac gtggaaaggc 1860 

tacggtaagt ctgtccccaa gacggtgaca gccaaaattg ccgaacggca cgatttcctg 1920 

aaggaaggcg atgtgctcac cgaggaattc gtagcacctt tgaccgagga gcagcagtca 1980 

gtctgcgacc aactgaaccg tcgtaccgag ttccgtgtga tcgaagaaga gttgcgt 2037 



(2) INFORMATION FOR SEQ ID NO : 8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1011 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDh[ESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: UNKNOWN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: miscjeature 

(B) LOCATION 1...1011 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 



cctcagcccg tcggccttaa agaaataacc attaaaccca tgtgcctcga acccataatt 60 

gctccgattt catccgagtt gctcgagcag gagctgactg ccgatcgttt tctgcggatg 120 

acaaacaaag ccggcaatga gatctatgtt tttacggccg aagaagctcc gcattgcatg 180 

aaagaagtag gccgactgcg agaagaagcc tttcggcatt atggcggagg tactggcaag 240 

gcgatcgata tagacgagtt cgacaccatg cccgggagct acaaacagct gatcgtatgg 300 

gatccgcaaa acaaggctat actcggaggc taccgcttta tctatgggcg ggacgttgct 360 

ttcgataccg atggcaagcc tttgctggca acggcagaga tgtttcgctt cagtgatgct 420 

tttttgcacg attatctccc ctacacagtc gaattgggac gttcgttcgt gtcgctccag 480 

taccaatcga cacggatggg cacaaaggcc atttttgtgc tggacaatct ttgggacggt 540 

atcggagcac tcactgtagt caatccagag gcactctatt tctatggcaa ggtgaccatg 600 

tacaaagact atgatcggcg agctcgcaat ctgatcctgt attttcttcg caagcacttc 660 

tccgatccgg aaggcttggt caagcctatt catcccctac cgatagagat cagtgcggag 720 

gacgaagcct tgttctcctc atccgacttt gacaccaatt acaagactct caatatagaa 780 

gtgcgcaagc tgggtatcaa tatccctcct ctcgtgagtg catatatagc tttgtctccg 840 

gagatgcgtg ttttcggcac tgcagtgaat gagtctttcg gagaggtgga ggaaaccggc 900 

atattcattg ctgtgggtaa gatcctggaa gagaaaaaac aacggcacat agagagcttc 960 

atcctcagcc ggaacgaaaa aaaaggtctc gacagtagca atggccgatc a 1011 



(2) INFORMATION FOR SEQ ID NO: 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 972 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 
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(ii) MOLECULE TYPE: DNA (genomic) 

( iii ) HYPOTHETICAL : NO 

{ iv) ANTI-SENSE : UNKNOWN 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : mis c_f eature 
<B) LOCATION 1. . . 972 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 



ctaacgttgt tttgttgcaa ctatttcaaa cagatgagag caaacatttg gcagatactt 60 

tccgtttcgg ttctcttttt cttcgggaca gcgatcggac aggctcagag tcgaaaccgt 120 

acatacgagg cttatgtgaa acagtacgcc gacgaagcta tccgacagat gagccgctac 180 

aatataccgg caagcatcac catagcacag gctttggtgg agacaggagc cggagccagt 24 0 

acactggcca gcgtacacaa caatcacttc gggatcaaat gccacaaatc gtggacgggc 300 

aagcgcacct atcgtaccga cgatgcgccg aacgaatgct tccgcagcta ttcggccgct 360 

cgcgaatcgt atgaagatca ttcccgattt ctgctccaac cacgctatcg tcccctgttc 420 

aaactcgaca gagaagacta tcggggctgg gctacggggt tgcaacgctg tggctatgcc 480 

accaatcggg gctatgccaa tctgctgatc aagatggtgg agctgtatga gctatatgct 540 

ttggatcgcg agaagtaccc ctcatggttc cacaagtctt accccgggtc caacaaaaaa 600 

tcccatcaaa cgaccaagca gaagcagagc ggactcaagc acgaagctta cttcagctac 660 

ggactgctct acatcatagc caagcaaggc gataccttcg attctttggc cgaagagttc 720 

gacatgagag cctccaaact ggccaaatac aacgatgctc ccgtggattt cccgatcgaa 780 

aagggcgatg tgatctatct ggagaaaaag cacgcatgct ccatctccaa acacacacag 84 0 

cacgtagtgc gtgtgggcga ttcgatgcac agtatctccc aacgctatgg catccggatg 900 

aagaacctct acaagctcaa cgacaaggat ggcgaatata taccccaaga gggcgatata 960 

ctgcgcttgc gc 972 



(2) INFORMATION FOR SEQ ID NO: 10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 201 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

( iii ) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1...201 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

Glu Leu Ser Lys Cys Tyr Met Asp Lys Val Ser Tyr Ala Leu Gly Leu 

15 10 15 

Ser lie Gly Asn Asn Phe Lys Ser Ser Gly lie Asp Ser Val Val Met 

20 25 30 

Asp Asp Phe Met Gin Gly Leu Ser Asp Val Leu Glu Glu Lys Ala Pro 

35 40 45 

Gin Leu Ser Tyr Asp Glu Ala Lys Arg Glu lie Glu Ala Tyr Phe Met 
50 55 60 
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A.s p 


Leu 


bin 




Lys 


Ala 


Val 


Lys 


Leu 


Asn 


Lys 


Glu 


Ala 


Gly 


Glu 


Glu 


65 




















7 5 








80 




Leu 


Lys 


i i e 


As n 


ax a 


His 


Lys 


Glu 


Gly 


Val 


Thr 


Thr 


Leu 


Pro 


Ser 










R R 
O 3 










90 










95 




fZ~\ \r 
J 


Leu 


bin 


Tyr 


<j1U 


val 


lie 


Lys 


Met 


Gly 


Glu 


Gly 


Pro 


Lys 


Pro 


Thr 








i no 










inc. 










110 






LeU 


Ser 


.Asp 




v a j_ 


XIII. 


Cys 


nl S 


Tyr 


TJ -i 

Hi S 


bly 


Thr 


Leu 


He 


Asn 


Gly 






115 










i on 

l^U 










l^o 






lie 


Val 


Phe 


A.S p 


o J- 


O t= I_ 




Asp 


Arg 


biy 


Glu 


Pro 


Ala 


Ser 


Phe 


Pro 




130 










135 










1 H U 










Leu 


Arg 


Gly 


Val 


Ile 






i rp 


i nr 


bill 


lie 


Leu 


<jin 


Leu 


Met 


Pro 


14 5 










J. o u 










ICC 

loo 










160 


Vdl 


«i y 




Lys 


Trp 


Lys 


val 


l nr 


He 


Pro 


Ser 


Asp 


Leu 


Ala 


Tyr 


Gly 










165 










170 










175 


Asp 


Arg 


Gly 


Ala 


Gly 


Glu 


His 


He 


Lys 


Pro 


Gly 


Ser 


Thr 


Leu 


He 


Phe 








180 










185 










190 






He 


He 


Glu 


Leu 


Leu 


Ser 


He 


Asn 


Lys 




















195 










200 

















(2) INFORMATION FOR SEQ ID NO: 11 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 3 amino' acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY: miscjeature 

(B) LOCATION 1. . .373 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 



Gly 


Ser 


Cys 


Gly 


Ser 


Ser 


Pro 


Ala 


He 


Gly 


Arg 


Ser 


Gly 


Cys 


Ser 


Arg 


1 








5 










10 










15 




Cys 


Asp 


Arg 


Arg 


He 


Gly 


His 


Gly 


Leu 


Ser 


Thr 


Ser 


Ala 


Phe 


Arg 


Thr 








20 










25 










30 




Tyr 


Ala 


Gly 


Leu 


Val 


Cys 


His 


Ser 


Arg 


Leu 


Arg 


Asp 


Gin 


Ser 


Ser 


Cys 






35 










40 










45 






Arg 


Val 


Cys 


Gly 


Met 


Arg 


Pro 


He 


Gly 


Gin 


Leu 


Phe 


Phe 


Pro 


Ser 


Cys 




50 










55 










60 








Pro 


Arg Arg 


Ser 


Ala 


Ser 


Asp 


Ala 


Gly 


Arg 


Thr 


Glu 


Arg 


He 


Gin 


Ser 


65 










70 










75 








80 


Pro 


Leu 


Gin 


Asn 


Leu 


Leu 


Gin 


Glu 


Met 


Gin 


Phe 


Ser 


Pro 


Thr 


Asn 


Arg 










85 










90 










95 


Arg 


Asn 


Pro 


Phe 


Asp 


Asn 


Met 


Asn 


Ser 


Arg 


His 


Leu 


Thr 


He 


Thr 


He 








100 










105 










110 






He 


Ala 


Gly 


Leu 


Ser 


Leu 


Phe 


Val 


Leu 


Thr 


Leu 


Gly 


Gly 


Cys 


Ser 


Val 






115 










120 










125 






Ala 


Gin 


Gin 


Asp 


Thr 


Gin 


Trp 


Thr 


Leu 


Gly 


Gly 


Lys 


Leu 


Phe 


Thr 


Ser 




130 










135 






140 










Ala 


Trp 


He 


Gin 


Arg 


Ser 


Ala 


Glu 


Tyr 


Gin 


Ala 


Leu 


Cys 


He 


Gin 


Ala 


145 










150 










155 








160 


Tyr 


Asn 


He 


Ala 


Thr 


Glu 


Arg 


Val 


Asp 


Ala 


Leu 


Pro 


Ala 


Glu Arg 


Lys 










165 










170 










175 


Gin 


Gly Asp 


Arg 


Pro 


Tyr 


Ala 


He 


Val 


Thr 


Asp 


He 


Asp 


Glu 


Thr 


He 








180 










185 








190 







19 



Leu 


Asp 


Asn 


Thr 


Pro 


Asn 


Ser 


Val 


Tyr 


Gin 


Ala 


Leu 


Arg 


Gly 


Lys 


Asp 






195 










200 










205 








Tyr 


Asp 


Glu 


Glu 


Thr 


Trp 


Gly 


Lys 


Trp 


Cys 


Ala 


Gin 


Ala 


Asp 


Ala 


Asp 




210 










215 










220 










Thr 


Leu 


Ala 


Gly 


Ala 


Leu 


Ser 


Phe 


Phe 


Leu 


His 


Ala 


Ala 


Asn 


Lys 


Gly 


225 










230 










235 








240 


lie 


Glu 


Val 


Phe 


Tyr 


Val 


Thr 


Asn 


Arg 


Arg 


Asp 


Asn 


Leu 


Arg 


Glu 


Ala 










245 










250 










255 




Thr 


Leu 


Gin 


Asn 


Leu 


Gin 


Arg 


Tyr 


Gly 


Phe 


Pro 


Phe 


Ala 


Asp 


Glu 


Glu 








260 










265 










270 






His 


Leu 


Leu 


Thr 


Thr 


His 


Gly 


Pro 


Ser 


Asp 


Lys 


Glu 


Pro 


Arg 


Arg 


Leu 






275 










280 










285 








Lys 


He 


Gin 


Glu 


Gin 


Tyr 


Glu 


He 


Val 


Leu 


Leu 


He 


Gly 


Asp 


Asn 


Leu 




290 










295 










300 










Gly Asp 


Phe 


His 


His 


Phe 


Phe 


Asn 


Thr 


Lys 


Glu 


Glu 


Ser 


Gly 


Arg 


Lys 


305 










310 










315 










320 


Gin 


Ala 


Leu 


Gly 


Leu 


Thr 


Ala 


Gly 


Glu 


Phe 


Gly Arg 


His 


Phe 


lie 


Met 










325 










330 










335 




Leu 


Pro 


Asn 


Pro 


Asn 


Tyr 


Gly 


Ser 


Trp 


Glu 


Pro 


Ala 


Trp 


Tyr 


Gly 


Gly 








340 










345 










350 






Lys 


Tyr 


Pro 


Pro 


Leu 


Pro 


Glu 


Arg 


Asp 


Lys 


Ala 


Leu 


Lys 


Gin 


Leu 


His 






355 










360 










365 








Ser 


Gin 


Asn 


Ser 


Arg 

























370 



(2) INFORMATION FOR SEQ ID NO: 12 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : miscjeature 

(B) LOCATION 1. . . 311 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 



Pro 


Tyr 


Asn 


Val 


Gin 


Ser 


He 


Ser 


Asn 


Lys 


Thr 


He 


Lys 


Lys 


Gin 


Met 


1 








5 










10 










15 




Glu 


Asn 


Leu 


Lys 


Asn 


He 


Gin 


Pro 


Arg 


Glu 


Asp 


Phe 


Asn 


Trp 


Glu 


Glu 








20 










25 










30 






Phe 


Glu 


Ala 


Gly 


Gly 


Val 


His 


Ala 


Ala 


Val 


Ser 


Arg 


Gin 


Glu 


Gin 


Glu 






35 










40 










45 








Ala 


Ala 


Tyr 


Asp 


Lys 


Thr 


Leu 


Asn 


Thr 


He 


Lys 


Glu 


Lys 


Glu 


Val 


Val 




50 










55 










60 










Met 


Gly Arg 


Val 


Thr 


Ala 


He 


Asn 


Lys 


Arg 


Glu 


Val 


Val 


He 


Asn 


Val 


65 










70 










75 










80 


Gly Tyr 


Lys 


Ser 


Glu 


Gly 


Val 


Val 


Pro 


Ala 


Thr 


Glu 


Phe 


Arg 


Tyr 


Asn 










85 










90 










95 




Pro 


Glu 


Leu 


Lys 


Val 


Gly 


Asp 


Glu 


Val 


Glu 


Val 


Tyr 


He 


Glu 


Asn 


Gin 








100 










105 










110 






Glu 


Asp 


Lys 


Lys 


Gly 


Gin 


Leu 


Val 


Leu 


Ser 


His 


Arg 


Lys 


Ala 


Arg 


Ala 






115 










120 










125 








Ala 


Arg 


Ser 


Trp 


Glu 


Arg 


Val 


Asn 


Glu 


Ala 


Leu 


Glu 


Lys 


Asp 


Glu 


He 




130 










135 










140 











20 



Val 


Lys 


Gly 


Tyr 


Val 


Lys 


Cys 


Arg 


Thr 


Lys 


Gly 


Gly 


Met 


He 


Val 


Asp 


145 










150 










155 








160 


Val 


Phe 


Gly 


He 


Glu 


Ala 


Phe 


Leu 


Pro 


Gly 


Ser 


Gin 


He 


Asp 


Val 


Arg 










165 










170 










175 


Pro 


He 


Arg 


Asp 


Tyr 


Asp 


Ala 


Phe 


Val 


Glu 


Lys 


Thr 


Met 


Glu 


Phe 


Lys 








180 










185 










190 




He 


Val 


Lys 


He 


Asn 


Gin 


Glu 


Tyr 


Lys 


Asn 


Val 


Val 


Val 


Ser 


His 


Lys 






195 










200 










205 






Val 


Leu 


He 


Glu 


Ala 


Glu 


Leu 


Glu 


Gin 


Gin 


Lys 


Lys 


Glu 


He 


He 


Gly 




210 










215 










220 








Lys 


Leu 


Glu 


Lys 


Gly 


Gin 


Val 


Leu 


Glu 


Gly 


He 


Val 


Lys 


Asn 


He 


Thr 


225 










230 










235 








240 


Ser 


Tyr 


Gly 


Val 


Phe 


He 


Asp 


Leu 


Gly 


Gly 


Val 


Asp 


Gly 


Leu 


He 


His 










245 










250 










255 




He 


Thr 


Asp 


Leu 


Ser 


Trp 


Gly 


Arg 


Val 


Ala 


His 


Pro 


Glu 


Glu 


He 


Val 








260 










265 










270 






Gin 


Leu 


Asp 


Gin 


Lys 


lie 


Asn 


Val 


Val 


He 


Leu 


Asp 


Phe 


Asp 


Glu 


Asp 






275 










280 










285 






Arg 


Lys 


Arg 


He 


Ala 


Leu 


Gly 


Leu 


Lys 


Gin 


Leu 


Met 


Pro 


Ser 


Ser 


Leu 




290 










295 










300 










Gly 


Cys 


Ser 


Arg 


Gin 


Arg 


Ala 




















305 










310 























(2) INFORMATION FOR SEQ ID NO: 13 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . . 985 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 



Val 


Thr 


Asp 


Lys 


Met 


Gin 


Asn 


Lys 


Gly 


Phe 


Val 


He 


Val 


He 


Thr 


Ser 


1 








5 










10 










15 




Ala 


Leu 


Ala 


He 
20 


He 


Cys 


Ala 


Phe 


Tyr 

25 


Leu 


Ser 


Phe 


Ser 


Phe 
30 


Val 


Thr 


Asn 


Arg 


Tyr 


Glu 


Lys 


Lys 


Ala 


Lys 


Ala 


Met 


Gly 


Asp 


Val 


Ala 


Gly 


Met 






35 










40 










45 






Ala 


Tyr 


Leu 


Asp 


Ser 


Met 


Ser 


Asn 


Glu 


Lys 


Val 


Trp 


Phe 


Gly 


Tyr 


Thr 




50 










55 










60 






Leu 


Lys 


Glu 


Ala 


Gin 


Ala 


Gin 


Gin 


He 


Gly 


Leu 


Gly 


Leu 


Asp 


Leu 


Lys 


65 










70 










75 








80 


Gly 


Gly Met 


Asn 


Val 


He 


Leu 


Lys 


Leu 


Asn 


Ala 


Ser 


Asp 


Leu 


Leu 


Arg 










85 










90 








95 


Asn 


Leu 


Ser 


Asn 


Lys 


Ser 


Leu 


Asp 


Pro 


Asn 


Phe 


Asn 


Lys 


Ala 


Leu 


Glu 








100 










105 








110 






Asn 


Ala 


Ala 
115 


Lys 


Ser 


Thr 


Glu 


Gin 
120 


Ser 


Asp 


Phe 


He 


Asp 
125 


He 


Phe 


Val 


Lys 


Glu 


Tyr 


Arg 


Lys 


Leu 


Asp 


Pro 


Asn 


Gly 


Arg 


Leu 


Ala 


Val 


He 


Phe 




130 










135 






140 










Gly 


Ser 


Gly 


Asp 


Leu 


Arg 


Asp 


Gin 


He 


Thr 


Ala 


Lys 


Ser 


Thr 


Asp 


Ala 


145 










150 










155 








160 



21 



Asp Val Val Arg Leu Leu Lys Glu Lys Tyr Asn Ser Ala Val Glu Ala 

165 170 175 

Ser Phe Asn Val Leu Arg Ala Arg lie Asp Ala Phe Gly Val Val Ala 

180 185 190 

Pro Asn Leu Gin Arg Leu Glu Gly Gin Gly Arg lie Leu Val Glu Leu 

195 200 205 

Pro Gly Val Lys Asp Pro Glu Arg Val Arg Thr Leu Leu Gin Arg Ser 

210 215 220 

Ala Asn Leu Gin Phe Trp Arg Thr Tyr Lys Phe Glu Glu Val Ser Gly 
225 230 235 240 

Asp Leu lie Ala Ala Asn Asp Arg Leu Ser Glu Leu Ala Met Asn Asn 

245 250 255 

Thr Asp Ala Thr Pro Glu Thr Glu Pro Ala Thr Thr Asp Ser Val Ala 

260 265 270 

Ala Thr Ala Asp Ser Ala Ala Val Gin Ala Val Ala Asp Ser Ala Thr 

275 280 285 

Val Ala Gin Lys Glu Ala Lys Asp Ala Thr Arg Lys Asp Ala Leu Phe 

290 295 300 

Ser Leu Leu Thr Pro Val Asn Arg Gly Gly Ala Val Val Gly Val Ala 
305 310 315 320 

Arg Arg Ala Asn Met Ala Gin lie Ser Glu Met Leu Gin Gin Ala His 

325 330 335 

Asp Leu Lys Val Thr Arg Glu Asp Val Leu Phe Leu Trp Gly Ala Lys 

340 345 350 

Ala lie Glu Asp Pro Glu Thr Lys Lys Glu Thr Asp Leu Tyr Glu Leu 

355 360 365 

Tyr Ala lie Arg Thr Asn Arg Thr Gly Asp Pro Asp Leu Gly Gly Asp 

370 375 380 

Val Val Thr Ser Ala Lys Ser Asp lie Gin Asn Asp Phe Gly Arg Ser 
385 390 395 400 

Glu Pro lie Val Ser Met Thr Met Asn Glu Glu Gly Ala Arg Lys Trp 

405 410 415 

Ala Arg lie Thr Lys Asp Asn Val Gly Arg Ala lie Ala lie Val Leu 

420 425 430 

Asp Gly Val Val Tyr Ser Ala Pro Asn Val Asn Asp Glu lie Thr Gly 

435 440 445 

Gly Arg Ser Gin lie Ser Gly His Phe Thr Val Glu Glu Ala Gly Asp 

450 455 460 

Leu Ala Asn Val Leu Asn Ser Gly Lys Met Asp Ala Thr Val Ser lie 
465 470 475 480 

Glu Gin Glu Asn Val lie Gly Pro Thr Leu Gly Ala Glu Ser lie Lys 

485 490 495 

Ala Gly Phe Leu Ser Phe Leu Leu Ala Leu Val lie Leu Met Cys Tyr 

500 505 510 

Met Cys Leu Ala Tyr Gly Phe Leu Pro Gly Leu lie Ala Asn Gly Ala 

515 520 525 

Leu lie Val Asn Ser Phe Phe Thr Leu Gly Val Leu Ala Ser Phe His 

530 535 540 

Ala Val Leu Thr Leu Ser Gly lie Ala Gly Leu Val Leu Thr Leu Gly 
545 550 555 560 

Met Ala Val Asp Ala Asn Val Leu lie Phe Glu Arg lie Lys Glu Glu 

565 570 575 

Leu Arg Ala Gly Lys Thr Pro lie Arg Ala Val Thr Asp Gly Tyr Gly 

580 585 590 

Asn Ala Phe Ser Ala lie Phe Asp Ser Asn Val Thr Thr lie lie Thr 

595 600 605 

Gly lie lie Leu Phe Leu Tyr Gly Thr Gly Pro lie Arg Gly Phe Ala 

610 615 620 

Thr Thr Leu lie lie Gly Leu lie Ala Ser Phe lie Thr Ala Val Phe 
625 630 635 640 

Leu Thr Arg lie Val Phe Glu Lys Leu Ala Lys Lys Gly Arg Leu Asp 

645 650 655 

Lys lie Thr Phe Thr Thr Ser lie Thr Arg Asn Leu Leu Val Asn Pro 
660 665 670 
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Ser 


T \7 r 


As n 


Tip 

_L _L fc: 


Leu 


ol y 


Lys 


A^r g 


Lys 


Thr 


Gly 


IT IlC 


Tip 

lie 


T I p 
11C 


Pro 


Val 






675 










680 










685 








lie 


lie 


lie 


Val 


Leu 


Gly 


Leu 


He 


Ala 


Ser 


Phe 


Thr 


He 


Glv 


Leu 


Asn 




690 










695 










700 










Arg 


Glv 


lie 


Glu 


Phe 


Ser 


Glv 


Glv 


Arg 


As n 


Tvr 


Val 


Val 


Lys 


Phe 


Asp 


705 










710 










715 










720 


Gin 


Pro 


Val 


Ser 


Ser 


Glu 


Ala 


Val 


Arg 


Ser 


Ala 


Leu 


Ser 


Ser 


Pro 


Leu 










725 










730 










735 




(Z 1 n 

Olil 


\J _L U 


Lys 


Val 


Leu 


Val 


Thr 


Ser 


He 


Gly 


Thr 


Glu 


Gly 


Thr 


Glu 


Val 








740 










745 










750 






Arg 


lie 


Ser 


Thr 


As n 


Tvr 

j 


Lys 


He 


Gin 


Glu 


Glu 


Ser 


Glu 


Glu 


Thr 


Glu 






755 










760 










765 








Al * 
rVJ. d 


rz 1 n 

ul u 


He 


Thr 


As p 


Lys 


Leu 




Gin 


Ser 


Leu 


Lys 


Gly 


Phe 


Tvr 
x y i 


Thr 




77 0 










775 










780 










Gin 


Gin 


Pro 


Thr 


Ala 


As p 


Gin 


Phe 


Leu 


As p 


As n 


He 


He 


Ser 


Ser 


Gin 


7 85 










790 










795 










800 


Lys 


Val 


Ser 


Pro 


Ser 


Met 


Ser 


Ser 


Asp 


He 


Thr 


Arg 


Glv 


Ala 


He 


Trr> 










805 










810 










815 




Ala 


Val 


Leu 


Leu 


Ser 


Met 


He 


Phe 


Met 


Ala 


He 


x y i 


He 


Leu 


He 


Arg 








O \J 










O £. «J 










8 30 






Phe 


Arg 


Asp 


He 


Ser 


Phe 


Ser 


Ala 


Gly 


Val 


Phe 


Val 


Ser 


Val 


Ala 


Ala 






8 35 










84 0 










845 








Thr 


Thr 


Phe 


Cy s 


He 


He 


Ala 


Leu 


i yi. 


Ala 


Leu 


Leu 


Trn 
iF 


Lys 


He 


Leu 




850 










8 55 










860 










XT JL \J 


IT lit: 


nil 


l it; U 


Glu 


He 


Asp 


bill 


A.s n 


p Vi «=> 

IT lit; 


Tip 

lie 


Ala 


Ala 


He 


Leu 


Ala 


8 65 










870 










875 










880 


lie 


lie 


Gly 




Ser 


Leu 


As n 


As p 


Thr 


Val 


Val 


Val 


Phe 


Asp 


Arg 


He 










885 










890 










895 




Arg 


Glu 


Thr 


Met 


Lys 


Leu 


Tvr 


Pro 


As n 


Arg 


Asp 


Arg 


Tvr 
j 


Gin 


Val 


He 








900 










905 










910 






As n 


Asp 


Ala 


Leu 


As n 


Ser 


Thr 


Leu 


Gly 


Arg 


Thr 


Leu 


As n 


Thr 


Ser 


Leu 






915 










920 










925 








Thr 


Thr 


Phe 


He 


Val 


Met 


Leu 


Val 


He 


Phe 


He 


Phe 


Gly 


Gly Ala 


Thr 




930 










935 










940 










Met 


Arg 


Ser 


Phe 


Thr 


Phe 


Ser 


He 


Leu 


Leu 


Gly 


He 


Val 


He 


Gly 


Thr 


945 










950 










955 










960 


Tyr 


Ser 


Thr 


Leu 


Phe 


Val 


Ala 


Thr 


Pro 


Leu 


Ala 


Tyr 


Glu 


He 


Gin 


Lys 










965 










970 










975 




Arg 


Lys 


Leu 


Asn 


Lys 


Ala 


Ala 


Lys 


Lys 

















980 985 



(2) INFORMATION FOR SEQ ID NO: 14 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . . 106 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 



Leu Ser Tyr Ser Gly Glu Ser Asp Ala Lys Glu Ser Asp Gin Asn Cys 
15 10 15 



23 



Arg 


Lys 


Cys 


Thr 
20 


Phe 


He 


Gly 


Phe 


Glu 
25 


Lys 


Arg 


Val 


Asn 


Thr 
30 


Met 


Arg 


Leu 


He 


Lys 

35 


Ala 


Phe 


Leu 


Val 


Gin 
40 


Leu 


Leu 


Leu 


Leu 


Pro 
45 


He 


Phe 


Phe 


Tyr 


Lys 
50 


Arg 


Phe 


He 


Ser 


Pro 
55 


Leu 


Thr 


Pro 


Pro 


Ser 
60 


Cys 


Arg 


Phe 


Thr 


Pro 


Ser 


Cys 


Ser 


Ser 


Tyr 


Ala 


He 


Glu 


Ala 


Leu 


Arg 


Lys 


Tyr 


Gly 


Pro 


65 










70 










75 










80 


Gly 


Lys 


Gly 


Leu 


Leu 
85 


Leu 


Ser 


He 


Lys 


Arg 
90 


He 


Leu 


Arg 


Cys 


His 
95 


Pro 


Trp 


Gly 


Gly 


Ser 
100 


Gly 


Tyr 


Asp 


Pro 


Val 
105 


Pro 















(2) INFORMATION FOR SEQ ID NO: 15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 456 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1. . . 456 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 



Arg 


Gly 


Gly 


Gin 


He 


Arg 


Arg 


His 


His 


Thr 


Asp 


Ser 


Ser 


Arg 


Gly 


Ser 


1 








5 










10 










15 




Asp 


Ser 


Lys 


Ala 


Cys 


Arg 


Phe 


Ser 


Phe 


His 


Ala 


Glu 


Thr 


He 


Gly 


Phe 








20 










25 










30 




Ser 


Asn 


His 


Gin 


Arg 


Ala 


Lys 


Met 


Asn 


Tyr 


Leu 


Tyr 


He 


Leu 


He 


Thr 






35 










40 










45 








Leu 


Leu 


Leu 


Ser 


Gly 


Phe 


Phe 


Ser 


Gly 


Ala 


Glu 


He 


Ala 


Phe 


Leu 


Ser 




50 










55 










60 










Ser 


Asp 


Lys 


Leu 


Arg 


Leu 


Glu 


Leu 


Asp 


Arg 


Asn 


Arg 


Gly 


Asp 


Leu 


Thr 


65 










70 










75 










80 


Gly Arg 


Ala 


Leu 


Asn 


Leu 


Leu 


Tyr 


Arg 


His 


Pro 


Asp 


Gin 


Leu 


Val 


Thr 










85 










90 










95 




Thr 


Leu 


Leu 


Val 


Gly 


Asn 


Asn 


He 


Val 


Leu 


Val 


Val 


Tyr 


Gly 


Leu 


Leu 








100 










105 










110 






Met 


Ala 


Gly 


Leu 


Leu 


Ala 


Ala 


Pro 


Leu 


Ala 


Gin 


Trp 


He 


Asp 


Asn 


Asp 






115 










120 










125 






Ala 


Met 


He 


Val 


Val 


Leu 


Gin 


Ser 


Val 


Leu 


Ser 


Thr 


He 


He 


He 


Leu 




130 










135 










140 










Phe 


Thr 


Gly 


Glu 


Phe 


Leu 


Pro 


Lys 


Ala 


He 


Phe 


Lys 


Thr 


Asn 


Ala 


Asn 


145 










150 










155 










160 


Met 


Met 


Met 


Arg 


Val 


Phe 


Ala 


Leu 


Pro 


He 


Val 


Ala 


He 


Tyr 


Tyr 


Leu 










165 










170 










175 




Leu 


Tyr 


Pro 


Leu 


Ser 


Lys 


Leu 


Phe 


Thr 


Gly 


Leu 


Ser 


Arg 


Ser 


Phe 


He 








180 










185 










190 






Arg 


Leu 


Val 


Asp 


Lys 


Asn 


Tyr 


Val 


Pro 


Thr 


Thr 


Val 


Gly 


Leu 


Gly Arg 






195 










200 










205 








Val 


Asp 


Leu 


Asp 


His 


Tyr 


Leu 


Ala 


Glu 


Asn 


Met 


Ser 


Gly 


Glu 


Asn 


Glu 




210 










215 










220 








Gin 


Asn 


Asp 


Leu 


Thr 


Thr 


Glu 


Val 


Lys 


He 


He 


Gin 


Asn 


Ala 


Leu 


Asp 


225 










230 










235 










240 



24 



Phe 


Ser 


Gly 


He 


Gin 
245 


Val 


Arg 


Asp 


Cys 


Met 
250 


He 


Pro 


Arg 


Asn 


Glu 
255 


Met 


He 


Ala 


Cys 


Glu 
260 


Leu 


Gin 


Thr 


Asp 


He 
265 


Glu 


Val 


Leu 


Lys 


Thr 
270 


Thr 


Phe 


He 


Asp 


Thr 
275 


Gly 


Leu 


Ser 


Lys 


He 
280 


He 


He 


Tyr 


Arg 


Gin 
285 


Asn 


He 


Asp 


Asp 


Val 
290 


Val 


Gly 


Tyr 


He 


His 
295 


Ser 


Ser 


Glu 


Met 


Phe 
300 


Arg 


Gly 


Gin 


Asp 


Trp 


Gin 


Lys 


Arg 


He 


Asn 


Thr 


Thr 


Val 


Phe 


Val 


Pro 


Glu 


Ser 


Met 


Tyr 


305 










310 










315 










320 


Ala 


Asn 


Lys 


Leu 


Met 
325 


Arg 


Leu 


Leu 


Met 


Gin 
330 


Arg 


Lys 


Lys 


Ser 


He 
335 


Ala 


He 


Val 


He 


Asp 
340 


Glu 


Leu 


Gly 


Gly 


Thr 
345 


Ala 


Gly 


Met 


Val 


Thr 

350 


Leu 


Glu 


Asp 


Leu 


Val 
355 


Glu 


Glu 


He 


Phe 


Gly 
360 


Asp 


He 


Glu 


Asp 


Glu 
365 


His 


Asp 


Thr 


Arg 


Lys 
370 


He 


He 


Ala 


Lys 


Gin 
375 


Leu 


Gly 


Pro 


His 


Thr 
380 


Tyr 


Leu 


Val 


Ser 


Gly Arg 


Met 


Glu 


He 


Asp 


Asp 


Val 


Asn 


Glu 


Arg 


Phe 


Gly 


Leu 


Ser 


Leu 


385 










390 










395 










400 


Pro 


Glu 


Ser 


Asp 


Asp 
405 


Tyr 


Leu 


Thr 


Val 


Ala 
410 


Gly 


Phe 


He 


Leu 


Asn 
415 


Ser 


His 


Gin 


Asn 


He 
420 


Pro 


Gin 


Ala 


Asn 


Glu 
425 


Val 


Val 


Glu 


He 


Ala 
430 


Pro 


Tyr 


Thr 


Phe 


Thr 
435 


He 


Leu 


Arg 


Ser 


Ser 
440 


Ser 


Thr 


Lys 


He 


Glu 
445 


Leu 


Val 


Lys 


Met 


Ser 
450 


He 


Asp 


Asp 


Gin 


Ser 
455 


Asn 



















(2) INFORMATION FOR SEQ ID NO: 16 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 679 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

<ix) FEATURE: 

(A) NAME/ KEY: miscjeature 

(B) LOCATION 1. . . 679 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 



Pro 


He 


Pro 


Phe 


Leu 


Leu 


Asn 


Met 


Tyr 


Ser 


Gly 


His 


His 


Lys 


He 


His 


1 








5 










10 










15 




Tyr 


Pro 


Phe 


Leu 


He 


Leu 


Leu 


Val 


Cys 


Leu 


Ala 


Phe 


Ala 


Ala 


Cys 


Lys 








20 










25 










30 






Ser 


Val 


Lys 


Leu 


Lys 


Asp 


Ala 


Glu 


Lys 


Ala 


His 


Asp 


Arg 


Gin 


Glu 


Tyr 






35 










40 










45 








Thr 


Lys 


Ala 


Ala 


Asp 


Met 


Tyr 


Asn 


Thr 


Leu 


Tyr 


Arg Arg 


Thr 


Arg 


Arg 




50 










55 










60 










Lys 


Gin 


Val 


Glu 


Met 


Lys 


Ala 


Tyr 


Thr 


Ala 


Phe 


Arg 


Ser 


Gly 


Glu 


Asn 


65 










70 










75 










80 


Tyr 


Arg 


Ala 


Ala 


Gly 


Arg 


Gin 


Ala 


Lys 


Ala 


Leu 


Arg 


Gly 


Tyr 


Leu 


Asn 










85 










90 










95 




Ala 


Arg Arg 


Tyr 


Gly 


Tyr 


Pro 


Asp 


Ser 


Val 


Val 


Leu 


Leu 


Arg 


Leu 


Ala 








100 










105 










110 







25 



Gin Thr Tyr Gin Gin 
115 

Arg Gly Tyr Leu Glu 
130 

Leu Glu Gly Cys Leu 
145 

Tyr Arg lie Arg Arg 
165 

Gly Pro Ala Tyr Ala 
180 

Arg Ser Lys Asp Asp 
195 

Pro Asn Asp lie Tyr 
210 

Arg Pro Asp Ser Val 
225 

Val Pro Thr lie Thr 
245 

Gin Gin Gly Ala Asp 
260 

Arg Ser Gly Glu Gly 
275 

Arg Asp Ser Leu Arg 
290 

Asp Tyr Leu Tyr Phe 
305 

Asp lie Tyr Arg Val 
325 

Asn Leu Gly Pro Asp 
340 

lie Asp Gly Asp Ser 
355 

Leu Gly Gly Leu Asp 
370 

Trp His Val Val Asn 
385 

Phe Gly Leu Ala Val 
405 

Pro Asp Asn Gly Val 
420 

Arg Gly Trp Pro His 
435 

Glu lie Gin Gly Tyr 
450 

Ala Thr Val Arg He 
465 

Val Thr Thr Arg Asp 
485 

Thr Arg Tyr Val Met 
500 

Val Glu Leu Lys Thr 
515 

Asp Phe Phe Leu Ala 
530 

Phe Tyr Asp Phe Asp 
545 

Leu Asp Glu Leu He 
565 

Glu Leu Gly Ser His 
580 

Gly Leu Ser Asp Arg 
595 

Arg Gly He Ala Ala 
610 



Gly Gly Asn Tyr Lys Glu 
120 

Ala Tyr Pro Lys Ser Tyr 
135 

Phe Ala Arg Gin Gin Lys 
150 155 
Ala Ala Glu Trp Asn Ser 
170 

Pro Asp Ala Ser Ala Leu 
185 

Gly Leu Asp Asn Ser Ser 
200 

He He Lys Arg Asp Ala 
215 

Ser Gly Gly He Asn Thr 
230 235 
Pro Asp Gly Ser Thr He 
250 

Tyr Asp Arg Thr Val Gin 
265 

Gly Trp Ser Asn Gly Ser 
280 

Met Ala Ala His Pro Ser 
295 

Val Ser Asn He Gly Gly 
310 315 
Lys Val Ser Asp Arg Ser 
330 

He Asn Thr Pro Gly Asp 
345 

Thr Leu Phe Phe Ala Ser 
360 

He Phe Lys Ala Thr Leu 
375 

Met Gly Gin Pro Val Asn 
390 395 
Glu Pro Lys Gly Lys Asn 
410 

Lys Gly Val Phe Cys Ser 
425 

Leu Phe His Phe Glu Leu 
440 

Val Met Asp Arg Glu Glu 
455 

Val Gly Glu Arg Gly Pro 
470 475 
Asp Gly Ser Tyr Lys Met 
490 

Leu Ala Gly Ala Ser Gly 
505 

Asp Thr Ala Lys Gin Ser 
520 

Ser Arg Glu Lys Ala Glu 
535 

Lys Ala Thr Leu Arg Pro 
550 555 
Arg He Leu Thr Asp Asn 
570 

Ala Asp Arg Lys Gly Pro 
585 

Arg Ala Lys Ser Val Val 
600 

Asp Arg Leu Thr Trp Lys 
615 



Ala 


Glu 


Val 


Leu 


Phe 




125 








Phe 


Ala 


Ala 


He 


Gly 


140 










Glu 


Tyr 


Pro 


Thr 


Arg 










160 


Ala 


Arg 


Gly Asp 


Phe 








175 




Tyr 


Phe 


Thr 


Ser 


Ser 






190 






He 


Thr 


Glv 


Leu 


Lys 




205 








Gin 


Gly 


Arg 


Trp 

tr 


Glv 


220 










Pro 


Trp 


Asp 


Glu 


Glv 










240 


Tyr 


Tyr 


Thr 


Leu 


Ala 








255 




He 


Tyr 


Ser 


Ala 


Ala 






270 






Leu 


Val 


Asp 


He 


Met 




285 








Met 


Ser 


Ala 


Ser 


Gly 


300 








Ser 


Tyr 


Gly 


Gly 


Lys 










320 


Tyr 


Gly 


Ser 


Pro 


Glu 








335 




Glu 


Met 


Phe 


Pro 


Phe 






350 






Asp 


Gly 


His 


Ala 


Gly 




365 








Asp 


Ser 


Thr 


Gly 


Gin 


380 










Ser 


Ser 


Ala 


Asp 


Asp 










400 


Lvs 


Glu 


Glu 


Ala 


Leu 








415 




Asn 


Arg 


Gly Asp 


Ala 






430 






Pro 


Ala 


He 


Tyr 


Thr 




445 








Asn 


Pro 


He 


Ala 


Gly 


460 








Val 


Gly 


Gin 


Gly 


Phe 










480 


Ser 


Val 


Gin 


Gly Asp 








495 




Tyr 


Leu 


Asn 


Gin 


Tyr 






510 






Glu 


Thr 


Tyr 


Tyr 


Val 




525 








Gly 


Leu 


Gin 


Asn 


He 


540 










Glu 


Ser 


Met 


Lys 


Ser 










560 


Pro 


Asp 


He 


Arg 


He 








575 




Asp 


Ala 


Tyr 


Asn 


Leu 






590 






Asp 


Tyr 


Leu 


Thr 


Ser 




605 








Gly 


Tyr 


Gly 


Lys 


Ser 


620 











26 



Val 


Pro 


Lys 


Thr 


Val 


Thr 


Ala 


Lys 


He 


Ala 


Glu Arg 


His 


Asp 


Phe 


Leu 


625 










630 










635 








640 


Lys 


Glu 


Gly 


Asp 


Val 


Leu 


Thr 


Glu 


Glu 


Phe 


Val Ala 


Pro 


Leu 


Thr 


Glu 










645 










650 








655 




Glu 


Gin 


Gin 


Ser 


Val 


Cys 


Asp 


Gin 


Leu 


Asn 


Arg Arg 


Thr 


Glu 


Phe 


Arg 








660 










665 








670 






Val 


He 


Glu 


Glu 


Glu 


Leu 


Arg 



















675 



(2) INFORMATION FOR SEQ ID NO: 17 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

<ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION 1. . .337 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 



Pro 


Gin 


Pro 


Val 


Gly 


Leu 


Lys 


Glu 


He 


Thr 


He 


Lys 


Pro 


Met 


Cys 


Leu 


1 








5 










10 










15 




Glu 


Pro 


He 


He 


Ala 


Pro 


He 


Ser 


Ser 


Glu 


Leu 


Leu 


Glu 


Gin 


Glu 


Leu 








20 










25 










30 






Thr 


Ala 


Asp 


Arg 


Phe 


Leu 


Arg 


Met 


Thr 


Asn 


Lys 


Ala 


Gly 


Asn 


Glu 


He 






35 










40 










45 








Tyr 


Val 


Phe 


Thr 


Ala 


Glu 


Glu 


Ala 


Pro 


His 


Cys 


Met 


Lys 


Glu 


Val 


Gly 




50 










55 










60 










Arg 


Leu 


Arg 


Glu 


Glu 


Ala 


Phe 


Arg 


His 


Tyr 


Gly 


Gly 


Gly 


Thr 


Gly 


Lys 


65 










70 










75 










80 


Ala 


He 


Asp 


He 


Asp 


Glu 


Phe 


Asp 


Thr 


Met 


Pro 


Gly 


Ser 


Tyr 


Lys 


Gin 










85 










90 










95 




Leu 


He 


Val 


Trp 


Asp 


Pro 


Gin 


Asn 


Lys 


Ala 


He 


Leu 


Gly 


Gly 


Tyr 


Arg 








100 










105 










110 






Phe 


He 


Tyr 


Gly 


Arg 


Asp 


Val 


Ala 


Phe 


Asp 


Thr 


Asp 


Gly 


Lys 


Pro 


Leu 






115 










120 










125 








Leu 


Ala 


Thr 


Ala 


Glu 


Met 


Phe 


Arg 


Phe 


Ser 


Asp 


Ala 


Phe 


Leu 


His 


Asp 




130 










135 










140 










Tyr 


Leu 


Pro 


Tyr 


Thr 


Val 


Glu 


Leu 


Gly 


Arg 


Ser 


Phe 


Val 


Ser 


Leu 


Gin 


145 










150 










155 










160 


Tyr 


Gin 


Ser 


Thr 


Arg 


Met 


Gly 


Thr 


Lys 


Ala 


He 


Phe 


Val 


Leu 


Asp 


Asn 










165 










170 










175 




Leu 


Trp 


Asp 


Gly 


He 


Gly 


Ala 


Leu 


Thr 


Val 


Val 


Asn 


Pro 


Glu 


Ala 


Leu 








180 










185 










190 






Tyr 


Phe 


Tyr 


Gly 


Lys 


Val 


Thr 


Met 


Tyr 


Lys 


Asp 


Tyr 


Asp 


Arg 


Arg 


Ala 






195 










200 










205 








Arg 


Asn 


Leu 


He 


Leu 


Tyr 


Phe 


Leu 


Arg 


Lys 


His 


Phe 


Ser 


Asp 


Pro 


Glu 




210 










215 










220 










Gly 


Leu 


Val 


Lys 


Pro 


He 


His 


Pro 


Leu 


Pro 


He 


Glu 


He 


Ser 


Ala 


Glu 


225 










230 










235 










240 


Asp 


Glu 


Ala 


Leu 


Phe 


Ser 


Ser 


Ser 


Asp 


Phe 


Asp 


Thr 


Asn 


Tyr 


Lys 


Thr 










245 










250 










255 




Leu 


Asn 


He 


Glu 


Val 


Arg 


Lys 


Leu 


Gly 


He 


Asn 


He 


Pro 


Pro 


Leu 


Val 








260 










265 










270 







27 



Ser 


Ala 


Tyr 


He 


Ala 


Leu 


Ser 


Pro 
9 fl n 


Glu 


Met 


Arg 


Val 


Phe 

£. O J 


Gly 


Thr Ala 


Val 


Asn 
290 


Glu 


Ser 


Phe 


Gly 


Glu 
295 


Val 


Glu 


Glu 


Thr 


Gly 
300 


He 


Phe 


He Ala 


Val 


Gly 


Lys 


He 


Leu 


Glu 


Glu 


Lys 


Lys 


Gin 


Arg 


His 


He 


Glu 


Ser Phe 


305 






310 










315 








320 


He 


Leu 


Ser 


Arg 


Asn 
325 


Glu 


Lys 


Lys 


Gly 


Leu 
330 


Asp 


Ser 


Ser 


Asn 


Gly Arg 

335 



Ser 



(2) INFORMATION FOR SEQ ID NO: 18 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 324 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

<ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1...324 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 



Leu 


Thr 


Leu 


Phe 


Cys 


Cys 


Asn 


Tyr 


Phe 


Lys 


Gin 


Met 


Arg 


Ala 


Asn 


He 


1 








5 










10 










15 




Trp 


Gin 


He 


Leu 


Ser 


Val 


Ser 


Val 


Leu 


Phe 


Phe 


Phe 


Gly 


Thr 


Ala 


He 






20 










25 










30 






Gly 


Gin 


Ala 


Gin 


Ser 


Arg 


Asn 


Arg 


Thr 


Tyr 


Glu 


Ala 


Tyr 


Val 


Lys 


Gin 




35 










40 










45 








Tyr 


Ala 


Asp 


Glu 


Ala 


He 


Arg 


Gin 


Met 


Ser 


Arg 


Tyr 


Asn 


He 


Pro 


Ala 


50 










55 










60 










Ser 


He 


Thr 


He 


Ala 


Gin 


Ala 


Leu 


Val 


Glu 


Thr 


Gly Ala 


Gly 


Ala 


Ser 


65 










70 










75 










80 


Thr 


Leu 


Ala 


Ser 


Val 


His 


Asn 


Asn 


His 


Phe 


Gly 


He 


Lys 


Cys 


His 


Lys 










85 










90 










95 




Ser 


Trp 


Thr 


Gly 


Lys 


Arg 


Thr 


Tyr 


Arg 


Thr 


Asp 


Asp 


Ala 


Pro 


Asn 


Glu 








100 










105 










110 






Cys 


Phe 


Arg 


Ser 


Tyr 


Ser 


Ala 


Ala 


Arg 


Glu 


Ser 


Tyr 


Glu 


Asp 


His 


Ser 






115 










120 










125 








Arg 


Phe 


Leu 


Leu 


Gin 


Pro 


Arg 


Tyr 


Arg 


Pro 


Leu 


Phe 


Lys 


Leu 


Asp 


Arg 


130 










135 










140 










Glu 


Asp 


Tyr 


Arg 


Gly 


Trp 


Ala 


Thr 


Gly 


Leu 


Gin 


Arg 


Cys 


Gly 


Tyr 


Ala 


145 










150 










155 










160 


Thr 


Asn 


Arg 


Gly 


Tyr 


Ala 


Asn 


Leu 


Leu 


He 


Lys 


Met 


Val 


Glu 


Leu 


Tyr 










165 










170 










175 




Glu 


Leu 


Tyr 


Ala 


Leu 


Asp 


Arg 


Glu 


Lys 


Tyr 


Pro 


Ser 


Trp 


Phe 


His 


Lys 








180 










185 










190 






Ser 


Tyr 


Pro 


Gly 


Ser 


Asn 


Lys 


Lys 


Ser 


His 


Gin 


Thr 


Thr 


Lys 


Gin 


Lys 






195 










200 










205 








Gin 


Ser 


Gly 


Leu 


Lys 


His 


Glu 


Ala 


Tyr 


Phe 


Ser 


Tyr 


Gly 


Leu 


Leu 


Tyr 




210 








215 










220 










He 


He 


Ala 


Lys 


Gin 


Gly Asp 


Thr 


Phe 


Asp 


Ser 


Leu 


Ala 


Glu 


Glu 


Phe 


225 










230 










235 










240 


Asp 


Met 


Arg 


Ala 


Ser 


Lys 


Leu 


Ala 


Lys 


Tyr 


Asn 


Asp 


Ala 


Pro 


Val 


Asp 










245 










250 










255 




Phe 


Pro 


He 


Glu 


Lys 


Gly Asp 


Val 


He 


Tyr 


Leu 


Glu 


Lys 


Lys 


His 


Ala 



28 









260 










265 










270 




Cys 


Ser 


lie 


Ser 


Lys 


His 


Thr 


Gin 


His 


Val 


Val 


Arg 


Val 


Gly Asp 


Ser 






275 










280 








285 




Met 


His 


Ser 


He 


Ser 


Gin 


Arg 


Tyr 


Gly 


He 


Arg 


Met 


Lys 


Asn Leu 


Tyr 




290 










295 










300 






Lys 


Leu 


Asn 


Asp 


Lys 


Asp 


Gly 


Glu 


Tyr 


He 


Pro 


Gin 


Glu 


Gly Asp 


He 


305 










310 










315 






320 


Leu 


Arg 


Leu 


Arg 

























(2) INFORMATION FOR SEQ ID NO: 19 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION 1. . . 195 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 



Met 


Asp 


Lys 


Val 


Ser 


Tyr 


Ala 


Leu 


Gly 


Leu 


Ser 


He 


Gly 


Asn 


Asn 


Phe 


1 








5 










10 








15 




Lys 


Ser 


Ser 


Gly 


He 


Asp 


Ser 


Val 


Val 


Met 


Asp 


Asp 


Phe 


Met 


Gin 


Gly 








20 










25 










30 




Leu 


Ser 


Asp 


Val 


Leu 


Glu 


Glu 


Lys 


Ala 


Pro 


Gin 


Leu 


Ser 


Tyr 


Asp 


Glu 






35 










40 










45 




Ala 


Lys 


Arg 


Glu 


He 


Glu 


Ala 


Tyr 


Phe 


Met 


Asp 


Leu 


Gin 


Gin 


Lys 


Ala 




50 










55 








60 








Val 


Lys 


Leu 


Asn 


Lys 


Glu 


Ala 


Gly 


Glu 


Glu 


Phe 


Leu 


Lys 


He 


Asn 


Ala 


65 










70 










75 








80 


His 


Lys 


Glu 


Gly 


Val 
85 


Thr 


Thr 


Leu 


Pro 


Ser 
90 


Gly 


Leu 


Gin 


Tyr 


Glu 

95 


Val 


He 


Lys 


Met 


Gly 


Glu 


Gly 


Pro 


Lys 


Pro 


Thr 


Leu 


Ser 


Asp 


Thr 


Val 


Thr 








100 










105 








110 






Cys 


His 


Tyr 


His 


Gly 


Thr 


Leu 


He 


Asn 


Gly 


He 


Val 


Phe 


Asp 


Ser 


Ser 






115 










120 










125 






Met 


Asp 


Arg 


Gly 


Glu 


Pro 


Ala 


Ser 


Phe 


Pro 


Leu 


Arg 


Gly 


Val 


He 


Ala 




130 










135 










140 








Gly 


Trp 


Thr 


Glu 


He 


Leu 


Gin 


Leu 


Met 


Pro 


Val 


Gly 


Ser 


Lys 


Trp 


Lys 


145 










150 










155 






160 


Val 


Thr 


He 


Pro 


Ser 


Asp 


Leu 


Ala 


Tyr 


Gly Asp 


Arg 


Gly 


Ala 


Gly 


Glu 










165 










170 










175 




His 


He 


Lys 


Pro 


Gly 


Ser 


Thr 


Leu 


He 


Phe 


He 


He 


Glu 


Leu 


Leu 


Ser 



180 185 190 

He Asn Lys 
195 



(2) INFORMATION FOR SEQ ID NO: 20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



29 



<ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: miscjeature 

(B) LOCATION 1. . .271 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20 



Met 


Asn 


Ser 


Arg 


His 


Leu 


Thr 


He 


Thr 


He 


He 


Ala 


Gly 


Leu 


Ser 


Leu 


1 








5 










10 










15 




Phe 


Val 


Leu 


Thr 


Leu 


Gly 


Gly 


Cys 


Ser 


Val 


Ala 


Gin 


Gin 


Asp 


Thr 


Gin 








20 










25 










30 






Trp 


Thr 


Leu 


Gly 


Gly 


Lys 


Leu 


Phe 


Thr 


Ser 


Ala 


Trp 


He 


Gin 


Arg 


Ser 






35 










40 










45 








Ala 


Glu 


Tyr 


Gin 


Ala 


Leu 


Cys 


He 


Gin 


Ala 


Tyr 


Asn 


He 


Ala 


Thr 


Glu 




50 










55 










60 










Arg 


Val 


Asp 


Ala 


Leu 


Pro 


Ala 


Glu 


Arg 


Lvs 


Gin 


Gly Asp 


Ara 


Pro 


Tvr 


65 










70 










75 










80 


Ala 


lie 


Val 


Thr 


Asp 


He 


Asp 


Glu 


Thr 


He 


Leu 


Asp 


Asn 


Thr 


Pro 


Asn 










85 










90 










95 




Ser 


Val 


Tvr 
j. j j_ 


Gin 


Ala 


Leu 


Arc 


Glv 


Lys 


Asp 


Tyr 


Asp 


Glu 


Glu 


Thr 


Trr> 








100 










105 










110 






Gly 


Lys 


Trp 


Cys 


Ala 


Gin 


Ala 


Asp 


Ala 


Asp 


Thr 


Leu 


Ala 


Gly 


Ala 


Leu 






115 










120 










125 








Ser 


Phe 


Phe 


Leu 


His 


Ala 


Ala 


Asn 


Lys 


Gly 


He 


Glu 


Val 


Phe 


Tyr 


Val 




130 










135 










140 










Thr 


Asn 


Arg 


Arg 


Asp 


Asn 


Leu 


Arg 


Glu 


Ala 


Thr 


Leu 


Gin 


Asn 


Leu 


Gin 


145 










150 










155 










160 


Arg 


Tyr 


Gly 


Phe 


Pro 


Phe 


Ala 


Asp 


Glu 


Glu 


His 


Leu 


Leu 


Thr 


Thr 


His 










165 










170 










175 




Gly 


Pro 


Ser 


Asp 


Lys 


Glu 


Pro 


Arg 


Arg 


Leu 


Lys 


He 


Gin 


Glu 


Gin 


Tyr 








180 










185 










190 






Glu 


He 


Val 


Leu 


Leu 


He 


Gly 


Asp 


Asn 


Leu 


Gly Asp 


Phe 


His 


His 


Phe 






195 










200 










205 








Phe 


Asn 


Thr 


Lys 


Glu 


Glu 


Ser 


Gly 


Arg 


Lys 


Gin 


Ala 


Leu 


Gly 


Leu 


Thr 




210 










215 










220 










Ala 


Gly 


Glu 


Phe 


Gly 


Arg 


His 


Phe 


He 


Met 


Leu 


Pro 


Asn 


Pro 


Asn 


Tyr 


225 










230 










235 










240 


Gly 


Ser 


Trp 


Glu 


Pro 


Ala 


Trp 


Tyr 


Gly 


Gly 


Lys 


Tyr 


Pro 


Pro 


Leu 


Pro 










245 










250 










255 




Glu 


Arg 


Asp 


Lys 


Ala 


Leu 


Lys 


Gin 


Leu 


His 


Ser 


Gin 


Asn 


Ser 


Arg 










260 










265 










270 







(2) INFORMATION FOR SEQ ID NO: 21 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 96 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 
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(ix) FEATURE: 

(A) NAME /KEY : miscjeature 
<B) LOCATION 1. . .296 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 



Met 


Glu 


Asn 


Leu 


Lys 


Asn 


He 


Gin 


Pro 


Arg 


Glu 


Asp 


Phe 


Asn 


Trp 


Glu 


1 








5 










10 










15 




Glu 


Phe 


Glu 


Ala 


Gly 


Gly 


Val 


His 


Ala 


Ala 


Val 


Ser 


Arg 


Gin 


Glu 


Gin 








20 










25 








30 






Glu 


Ala 


Ala 


Tyr 


Asp 


Lys 


Thr 


Leu 


Asn 


Thr 


He 


Lys 


Glu 


Lys 


Glu 


Val 






35 










40 










45 






Val 


Met 


Gly 


Arg 


Val 


Thr 


Ala 


He 


Asn 


Lys 


Arg 


Glu 


Val 


Val 


He 


Asn 




50 










55 










60 










Val 


Gly 


Tyr 


Lys 


Ser 


Glu 


Gly 


Val 


Val 


Pro 


Ala 


Thr 


Glu 


Phe 


Arg 


Tyr 


65 










70 










75 








80 


Asn 


Pro 


Glu 


Leu 


Lys 


Val 


Gly 


Asp 


Glu 


Val 


Glu 


Val 


Tyr 


He 


Glu 


Asn 










85 










90 










95 




Gin 


Glu 


Asp 


Lys 


Lys 


Gly 


Gin 


Leu 


Val 


Leu 


Ser 


His 


Arg 


Lys 


Ala 


Arg 








100 










105 










110 




Ala 


Ala 


Arg 


Ser 


Trp 


Glu 


Arg 


Val 


Asn 


Glu 


Ala 


Leu 


Glu 


Lys 


Asp 


Glu 






115 










120 










125 




He 


Val 


Lys 


Gly 


Tyr 


Val 


Lys 


Cys 


Arg 


Thr 


Lys 


Gly 


Gly 


Met 


lie 


Val 




130 










135 










140 










Asp 


Val 


Phe 


Gly 


He 


Glu 


Ala 


Phe 


Leu 


Pro 


Gly 


Ser 


Gin 


He 


Asp 


Val 


145 










150 










155 








160 


Arg 


Pro 


He 


Arg 


Asp 


Tyr 


Asp 


Ala 


Phe 


Val 


Glu 


Lys 


Thr 


Met 


Glu 


Phe 










165 










170 










175 




Lys 


He 


Val 


Lys 


He 


Asn 


Gin 


Glu 


Tyr 


Lys 


Asn 


Val 


Val 


Val 


Ser 


His 








180 










185 










190 






Lys 


Val 


Leu 


He 


Glu 


Ala 


Glu 


Leu 


Glu 


Gin 


Gin 


Lys 


Lys 


Glu 


He 


He 






195 










200 










205 








Gly 


Lys 


Leu 


Glu 


Lys 


Gly 


Gin 


Val 


Leu 


Glu 


Gly 


He 


Val 


Lys 


Asn 


He 




210 










215 










220 








Thr 


Ser 


Tyr 


Gly 


Val 


Phe 


He 


Asp 


Leu 


Gly 


Gly 


Val 


Asp 


Gly 


Leu 


He 


225 










230 










235 










240 


His 


He 


Thr 


Asp 


Leu 


Ser 


Trp 


Gly Arg 


Val 


Ala 


His 


Pro 


Glu 


Glu 


He 










245 










250 










255 




Val 


Gin 


Leu 


Asp 


Gin 


Lys 


He 


Asn 


Val 


Val 


He 


Leu 


Asp 


Phe 


Asp 


Glu 








260 










265 










270 




Asp 


Arg 


Lys 


Arg 


He 


Ala 


Leu 


Gly 


Leu 


Lys 


Gin 


Leu 


Met 


Pro 


Ser 


Ser 






275 










280 










285 








Leu 


Gly 


Cys 


Ser 


Arg 


Gin 


Arg 


Ala 



















290 295 



(2) INFORMATION FOR SEQ ID NO: 22 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 981 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .981 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22 

Met Gin Asn Lys Gly Phe Val He Val He Thr Ser Ala Leu Ala He 

15 10 15 

He Cys Ala Phe Tyr Leu Ser Phe Ser Phe Val Thr Asn Arg Tyr Glu 

20 25 30 

Lys Lys Ala Lys Ala Met Gly Asp Val Ala Gly Met Ala Tyr Leu Asp 

35 40 45 

Ser Met Ser Asn Glu Lys Val Trp Phe Gly Tyr Thr Leu Lys Glu Ala 

50 55 60 

Gin Ala Glh Gin He Gly Leu Gly Leu Asp Leu Lys Gly Gly Met Asn 
65 70 75 80 

Val He Leu Lys Leu Asn Ala Ser Asp Leu Leu Arg Asn Leu Ser Asn 

85 90 95 

Lys Ser Leu Asp Pro Asn Phe Asn Lys Ala Leu Glu Asn Ala Ala Lys 

100 105 110 

Ser Thr Glu Gin Ser Asp Phe He Asp He Phe Val Lys Glu Tyr Arg 

115 120 125 

Lys Leu Asp Pro Asn Gly Arg Leu Ala Val He Phe Gly Ser Gly Asp 

130 135 140 

Leu Arg Asp Gin He Thr Ala Lys Ser Thr Asp Ala Asp Val Val Arg 
145 150 155 160 

Leu Leu Lys Glu Lys Tyr Asn Ser Ala Val Glu Ala Ser Phe Asn Val 

165 170 175 

Leu Arg Ala Arg He Asp Ala Phe Gly Val Val Ala Pro Asn Leu Gin 

180 185 190 

Arg Leu Glu Gly Gin Gly Arg He Leu Val Glu Leu Pro Gly Val Lys 

195 200 205 

Asp Pro Glu Arg Val Arg Thr Leu Leu Gin Arg Ser Ala Asn Leu Gin 

210 215 220 

Phe Trp Arg Thr Tyr Lys Phe Glu Glu Val Ser Gly Asp Leu He Ala 
225 230 235 240 

Ala Asn Asp Arg Leu Ser Glu Leu Ala Met Asn Asn Thr Asp Ala Thr 

245 250 255 

Pro Glu Thr Glu Pro Ala Thr Thr Asp Ser Val Ala Ala Thr Ala Asp 

260 265 270 

Ser Ala Ala Val Gin Ala Val Ala Asp Ser Ala Thr Val Ala Gin Lys 

275 280 285 

Glu Ala Lys Asp Ala Thr Arg Lys Asp Ala Leu Phe Ser Leu Leu Thr 

290 295 300 

Pro Val Asn Arg Gly Gly Ala Val Val Gly Val Ala Arg Arg Ala Asn 
305 310 315 320 

Met Ala Gin He Ser Glu Met Leu Gin Gin Ala His Asp Leu Lys Val 

325 330 335 

Thr Arg Glu Asp Val Leu Phe Leu Trp Gly Ala Lys Ala He Glu Asp 

340 345 350 

Pro Glu Thr Lys Lys Glu Thr Asp Leu Tyr Glu Leu Tyr Ala He Arg 

355 360 365 

Thr Asn Arg Thr Gly Asp Pro Asp Leu Gly Gly Asp Val Val Thr Ser 

370 375 380 

Ala Lys Ser Asp He Gin Asn Asp Phe Gly Arg Ser Glu Pro He Val 
385 390 395 400 

Ser Met Thr Met Asn Glu Glu Gly Ala Arg Lys Trp Ala Arg He Thr 

405 410 415 

Lys Asp Asn Val Gly Arg Ala He Ala He Val Leu Asp Gly Val Val 

420 425 430 

Tyr Ser Ala Pro Asn Val Asn Asp Glu He Thr Gly Gly Arg Ser Gin 

435 440 445 

He Ser Gly His Phe Thr Val Glu Glu Ala Gly Asp Leu Ala Asn Val 

450 455 460 

Leu Asn Ser Gly Lys Met Asp Ala Thr Val Ser He Glu Gin Glu Asn 
465 470 475 480 

Val He Gly Pro Thr Leu Gly Ala Glu Ser He Lys Ala Gly Phe Leu 
485 490 495 
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Ser 


Phe 


Leu 


Leu 


Ala 


Leu 


Val 


He 


Leu 


Met 


Cys 


Tyr 


Met 


Cys 


Leu 


Ala 








500 










505 










510 






Tyr 


Gly 


Phe 


Leu 


Pro 


Gly 


Leu 


He 


Ala 


Asn 


Gly 


Ala 


Leu 


He 


Val 


Asn 






515 










520 










525 








Ser 


Phe 


Phe 


Thr 


Leu 


Gly 


Val 


Leu 


Ala 


Ser 


Phe 


His 


Ala 


Val 


Leu 


Thr 




530 










535 










540 










Leu 


Ser 


Gly 


He 


Ala 


Gly 


Leu 


Val 


Leu 


Thr 


Leu 


Gly 


Met 


Ala 


Val 


Asp 


545 










550 










555 








560 


Ala 


Asn 


Val 


Leu 


He 


Phe 


Glu 


Arg 


He 


Lys 


Glu 


Glu 


Leu 


Arg 


Ala 


Gly 










565 










570 










575 




Lys 


Thr 


Pro 


He 


Arg 


Ala 


Val 


Thr 


Asp 


Gly 


Tyr 


Gly 


Asn 


Ala 


Phe 


Ser 








580 










585 










590 






Ala 


He 


Phe 


Asp 


Ser 


Asn 


Val 


Thr 


Thr 


He 


He 


Thr 


Gly 


He 


He 


Leu 






595 










600 










605 








Phe 


Leu 


Tyr 


Gly 


Thr 


Gly 


Pro 


He 


Arg 


Gly 


Phe 


Ala 


Thr 


Thr 


Leu 


He 




610 










615 










620 










He 


Gly 


Leu 


He 


Ala 


Ser 


Phe 


He 


Thr 


Ala 


Val 


Phe 


Leu 


Thr 


Arg 


He 


625 










630 










635 








640 


Val 


Phe 


Glu 


Lys 


Leu 


Ala 


Lys 


Lys 


Gly 


Arg 


Leu 


Asp 


Lys 


He 


Thr 


Phe 










645 










650 










655 




Thr 


Thr 


Ser 


He 


Thr 


Arg 


Asn 


Leu 


Leu 


Val 


Asn 


Pro 


Ser 


Tyr 


Asn 


He 








660 










665 










670 






Leu 


Gly 


Lys 


Arg 


Lys 


Thr 


Gly 


Phe 


He 


He 


Pro 


Val 


He 


He 


He 


Val 






675 










680 










685 








Leu 


Gly 


Leu 


He 


Ala 


Ser 


Phe 


Thr 


He 


Gly 


Leu 


Asn 


Arg 


Gly 


He 


Glu 




690 










695 










700 










Phe 


Ser 


Gly 


Gly Arg 


Asn 


Tyr 


Val 


Val 


Lys 


Phe 


Asp 


Gin 


Pro 


Val 


Ser 


705 










710 










715 








720 


Ser 


Glu 


Ala 


Val 


Arg 


Ser 


Ala 


Leu 


Ser 


Ser 


Pro 


Leu 


Gin 


Glu 


Lys 


Val 










725 










730 










735 




Leu 


Val 


Thr 


Ser 


He 


Gly 


Thr 


Glu 


Gly 


Thr 


Glu 


Val 


Arg 


He 


Ser 


Thr 








740 










745 








750 






Asn 


Tyr 


Lys 


He 


Gin 


Glu 


Glu 


Ser 


Glu 


Glu 


Thr 


Glu 


Ala 


Glu 


He 


Thr 






755 










760 










765 








Asp 


Lys 


Leu 


Tyr 


Gin 


Ser 


Leu 


Lys 


Gly 


Phe 


Tyr 


Thr 


Gin 


Gin 


Pro 


Thr 




770 










775 










780 










Ala 


Asp 


Gin 


Phe 


Leu 


Asp 


Asn 


He 


lie 


Ser 


Ser 


Gin 


Lys 


Val 


Ser 


Pro 


785 










790 










795 








800 


Ser 


Met 


Ser 


Ser 


Asp 


He 


Thr 


Arg 


Gly 


Ala 


He 


Trp 


Ala 


Val 


Leu 


Leu 










805 










810 










815 




Ser 


Met 


He 


Phe 


Met 


Ala 


He 


Tyr 


He 


Leu 


He 


Arg 


Phe 


Arg 


Asp 


He 








820 










825 










830 






Ser 


Phe 


Ser 


Ala 


Gly Val 


Phe 


Val 


Ser 


Val 


Ala 


Ala 


Thr 


Thr 


Phe 


Cys 






835 










840 










845 






He 


He 


Ala 


Leu 


Tyr 


Ala 


Leu 


Leu 


Trp 


Lys 


He 


Leu 


Pro 


Phe 


Thr 


Met 




850 










855 










860 










Glu 


He 


Asp 


Gin 


Asn 


Phe 


He 


Ala 


Ala 


He 


Leu 


Ala 


He 


He 


Gly 


Tyr 


865 










870 










875 








880 


Ser 


Leu 


Asn 


Asp 


Thr 


Val 


Val 


Val 


Phe 


Asp 


Arg 


He 


Arg 


Glu 


Thr 


Met 










885 










890 










895 




Lys 


Leu 


Tyr 


Pro 


Asn 


Arg 


Asp 


Arg 


Tyr 


Gin 


Val 


He 


Asn 


Asp 


Ala 


Leu 








900 










905 










910 






Asn 


Ser 


Thr 


Leu 


Gly 


Arg 


Thr 


Leu 


Asn 


Thr 


Ser 


Leu 


Thr 


Thr 


Phe 


He 






915 










920 










925 








Val 


Met 


Leu 


Val 


He 


Phe 


He 


Phe 


Gly 


Gly 


Ala 


Thr 


Met 


Arg 


Ser 


Phe 




930 










935 










940 








Thr 


Phe 


Ser 


He 


Leu 


Leu 


Gly 


He 


Val 


He 


Gly 


Thr 


Tyr 


Ser 


Thr 


Leu 


945 










950 










955 








960 


Phe 


Val 


Ala 


Thr 


Pro 


Leu 


Ala 


Tyr 


Glu 


He 


Gin 


Lys 


Arg 


Lys 


Leu 


Asn 










965 










970 










975 




Lys 


Ala 


Ala 


Lys 


Lys 

























980 
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(2) INFORMATION FOR SEQ ID NO: 23 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 6 amino acids 
{B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1. . . 76 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 

Phe Leu Val Gin Leu Leu Leu Leu Pro lie 

10 15 
lie Ser Pro Leu Thr Pro Pro Ser Cys Arg 

25 30 
Ser Tyr Ala lie Glu Ala Leu Arg Lys Tyr 

40 45 
Leu Leu Ser lie Lys Arg lie Leu Arg Cys 
55 60 
Gly Tyr Asp Pro Val Pro 
75 



ID NO:24 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



Met 


Arg 


Leu 


He 


Lys 


Ala 


1 








5 




Phe 


Phe 


Tyr 


Lys 


Arg 


Phe 








20 






Phe 


Thr 


Pro 


Ser 


Cys 


Ser 






35 








Gly 


Pro 


Gly 


Lys 


Gly 


Leu 




50 










His 


Pro 


Trp 


Gly 


Gly 


Ser 


65 










70 


(2) 


INFORMATION 


FOR 


SEQ 



(ii) MOLECULE TYPE: protein 

(iii ) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1...417 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 



Met 


Asn 


Tyr 


Leu 


Tyr 


He 


Leu 


He 


Thr 


Leu 


Leu 


Leu 


Ser 


Gly 


Phe 


Phe 


1 








5 










10 








15 




Ser 


Gly Ala 


Glu 


He 


Ala 


Phe 


Leu 


Ser 


Ser 


Asp 


Lys 


Leu 


Arg 


Leu 


Glu 


Leu 






20 










25 










30 






Asp 


Arg 


Asn 


Arg 


Gly Asp 


Leu 


Thr 


Gly Arg 


Ala 


Leu 


Asn 


Leu 


Leu 


Tyr 




35 










40 










45 








Arg 
50 


His 


Pro 


Asp 


Gin 


Leu 
55 


Val 


Thr 


Thr 


Leu 


Leu 
60 


Val 


Gly Asn 


Asn 


He 
65 


Val 


Leu 


Val 


Val 


Tyr 
70 


Gly 


Leu 


Leu 


Met 


Ala 
75 


Gly 


Leu 


Leu 


Ala 


Ala 
80 


Pro 


Leu 


Ala 


Gin 


Trp 
85 


He 


Asp 


Asn 


Asp 


Ala 
90 


Met 


He 


Val 


Val 


Leu 
95 


Gin 
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Ser Val Leu Ser Thr lie lie lie Leu Phe Thr Gly Glu Phe Leu Pro 

100 105 110 

Lys Ala lie Phe Lys Thr Asn Ala Asn Met Met Met Arg Val Phe Ala 

115 120 125 

Leu Pro lie Val Ala lie Tyr Tyr Leu Leu Tyr Pro Leu Ser Lys Leu 

130 135 140 

Phe Thr Gly Leu Ser Arg Ser Phe lie Arg Leu Val Asp Lys Asn Tyr 
145 150 155 160 

Val Pro Thr Thr Val Gly Leu Gly Arg Val Asp Leu Asp His Tyr Leu 

165 170 175 

Ala Glu Asn Met Ser Gly Glu Asn Glu Gin Asn Asp Leu Thr Thr Glu 

180 185 190 

Val Lys lie lie Gin Asn Ala Leu Asp Phe Ser Gly lie Gin Val Arg 

195 200 205 

Asp Cys Met lie Pro Arg Asn Glu Met lie Ala Cys Glu Leu Gin Thr 

210 215 220 

Asp lie Glu Val Leu Lys Thr Thr Phe lie Asp Thr Gly Leu Ser Lys 
225 230 235 240 

lie lie lie Tyr Arg Gin Asn lie Asp Asp Val Val Gly Tyr lie His 

245 250 255 

Ser Ser Glu Met Phe Arg Gly Gin Asp Trp Gin Lys Arg lie Asn Thr 

260 265 270 

Thr Val Phe Val Pro Glu Ser Met Tyr Ala Asn Lys Leu Met Arg Leu 

275 280 285 

Leu Met Gin Arg Lys Lys Ser lie Ala lie Val lie Asp Glu Leu Gly 

290 295 300 

Gly Thr Ala Gly Met Val Thr Leu Glu Asp Leu Val Glu Glu lie Phe 
305 310 315 320 

Gly Asp lie Glu Asp Glu His Asp Thr Arg Lys lie lie Ala Lys Gin 

325 330 335 

Leu Gly Pro His Thr Tyr Leu Val Ser Gly Arg Met Glu lie Asp Asp 

340 345 350 

Val Asn Glu Arg Phe Gly Leu Ser Leu Pro Glu Ser Asp Asp Tyr Leu 

355 360 365 

Thr Val Ala Gly Phe lie Leu Asn Ser His Gin Asn lie Pro Gin Ala 

370 375 380 

Asn Glu Val Val Glu lie Ala Pro Tyr Thr Phe Thr lie Leu Arg Ser 
385 390 395 400 

Ser Ser Thr Lys lie Glu Leu Val Lys Met Ser lie Asp Asp Gin Ser 
405 410 415 

Asn 



(2) INFORMATION FOR SEQ ID NO: 25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 672 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

{ iii ) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1. . . 672 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
Met Tyr Ser Gly His His Lys lie His Tyr Pro Phe Leu lie Leu Leu 



35 



1 








5 








Val 


Cys 


Leu 


Ala 


Phe 


Ala 


Ala 


Cys 








20 








Glu 


Lys 


Ala 


His 


Asp 


Arg 


Gin 


Glu 






35 










40 


Asn 


Thr 


Leu 


Tyr 


Arg 


Arg 


Thr 


Arg 




50 










55 




Tyr 


Thr 


Ala 


Phe 


Arg 


Ser 


Gly 


Glu 


65 










70 






Ala 


Lys 


Ala 


Leu 


Arg 


Gly 


Tyr 


Leu 










85 








Asp 


Ser 


Val 


Val 


Leu 


Leu 


Arg 


Leu 








100 










Asn 


Tyr 


Lys 


Glu 


Ala 


Glu 


Val 


Leu 






115 










120 


Pro 


Lys 


Ser 


Tyr 


Phe 


Ala 


Ala 


He 




130 










135 




Arg 


Gin 


Gin 


Lys 


Glu 


Tyr 


Pro 


Thr 


145 










150 






Glu 


Trp 


Asn 


Ser 


Ala 


Arg 


Gly Asp 










165 








Ala 


Ser 


Ala 


Leu 


Tyr 


Phe 


Thr 


Ser 








180 










Asp 


Asn 


Ser 


Ser 


He 


Thr 


Gly 


Leu 






195 










200 


Lys 


Arg 


Asp 


Ala 


Gin 


Gly Arg 


Trp 




210 










215 




Gly 


He 


Asn 


Thr 


Pro 


Trp 


Asp 


Glu 


225 










230 






Gly 


Ser 


Thr 


He 


Tyr 


Tyr 


Thr 


Leu 










245 








Arg 


Thr 


Val 


Gin 


He 


Tyr 


Ser 


Ala 








260 










Ser 


Asn 


Gly 


Ser 


Leu 


Val 


Asp 


He 






275 










280 


Ala 


His 


Pro 


Ser 


Met 


Ser 


Ala 


Ser 




290 










295 




Asn 


He 


Gly 


Gly 


Ser 


Tyr 


Gly 


Gly 


305 










310 






Ser 


Asp 


Arg 


Ser 


Tyr 


Gly 


Ser 


Pro 










325 








Thr 


Pro 


Gly Asp 


Glu 


Met 


Phe 


Pro 








340 










Phe 


Phe 


Ala 


Ser 


Asp 


Gly 


His 


Ala 






355 










360 


Lys 


Ala 


Thr 


Leu 


Asp 


Ser 


Thr 


Gly 




370 










375 




Gin 


Pro 


Val 


Asn 


Ser 


Ser 


Ala 


Asp 


385 










390 




Lys 


Gly 


Lys 


Asn 


Lys 


Glu 


Glu 


Ala 










405 








Val 


Phe 


Cys 


Ser 


Asn 


Arg 


Gly Asp 








420 










His 


Phe 


Glu 


Leu 


Pro 


Ala 


He 


Tyr 






435 










440 


Asp 


Arg 


Glu 


Glu 


Asn 


Pro 


He 


Ala 




450 










455 




Glu 


Arg 


Gly 


Pro 


Val 


Gly 


Gin 


Gly 


465 










470 






Ser 


Tyr 


Lys 


Met 


Ser 


Val 


Gin 


Gly 










485 








Gly 


Ala 


Ser 


Gly 


Tyr 


Leu 


Asn 


Gin 








500 










Ala 


Lys 


Gin 


Ser 


Glu 


Thr 


Tyr 


Tyr 





10 










15 




Lys 


Ser 


Val 


Lys 


Leu 


Lys 


Asp 


Ala 


25 










30 






Tyr 


Thr 


Lys 


Ala 


Ala 


Asp 


Met 


Tyr 










45 








Arg 


Lys 


Gin 


Val 


Glu 


Met 


Lys 


Ala 








60 










Asn 


Tyr 


Arg 


Ala 


Ala 


Gly Arg 


Gin 






75 










80 


Asn 


Ala 


Arg 


Arg 


Tyr 


Gly 


Tyr 


Pro 




90 










95 




Ala 


Gin 


Thr 


Tyr 


Gin 


Gin 


Gly 


Gly 


105 










110 






Phe 


Arg 


Gly 


Tyr 


Leu 


Glu 


Ala 


Tyr 










125 








Gly 


Leu 


Glu 


Gly 


Cys 


Leu 


Phe 


Ala 








140 










Arg 


Tyr 


Arg 


He 


Arg 


Arq 


Ala 


Ala 






155 










160 


Phe 


Gly 


Pro 


Ala 


Tyr 


Ala 


Pro 


Asp 




170 










175 




Ser 


Arq 


Ser 


Lvs 


Asp 


Asp 


Glv 


Leu 


185 










190 






Lys 


Pro 


Asn 


Asp 


He 


Tyr 


He 


He 










205 








Gly 


Arg 


Pro 


Asp 


Ser 


Val 


Ser 


Gly 








220 










Gly 


Val 


Pro 


Thr 


He 


Thr 


Pro 


Asp 






235 










240 


Ala 


Gin 


Gin 


Gly Ala 


Asp 


Tyr 


Asp 




250 










255 




Ala 


Arg 


Ser 


Gly 


Glu 


Glv 


Glv 


Trp 


265 










270 






Met 


Arg 


Asp 


Ser 


Leu 


Arg 


Met 


Ala 










285 








Gly Asp 


Tyr 


Leu 


Tyr 


Phe 


Val 


Ser 








300 










Lys 


Asp 


He 


Tyr 


Arg 


Val 


Lvs 


Val 






315 










320 


Glu 


Asn 


Leu 


Gly 


Pro 


Asp 


He 


Asn 




330 










335 




Phe 


He 


Asp 


Gly 


Asp 


Ser 


Thr 


Leu 


345 










350 






Gly 


Leu 


Gly 


Gly 


Leu 


Asp 


He 


Phe 










365 








Gin 


Trp 


His 


Val 


Val 


Asn 


Met 


Gly 








380 










Asp 


Phe 


Gly 


Leu 


Ala 


Val 


Glu 


Pro 






395 










400 


Leu 


Pro 


Asp 


Asn 


Gly 


Val 


Lys 


Gly 




410 










415 




Ala 


Arg 


Gly 


Trp 


Pro 


His 


Leu 


Phe 


425 










430 






Thr 


Glu 


He 


Gin 


Gly 


Tyr 


Val 


Met 










445 








Gly Ala 


Thr 


Val 


Arg 


He 


Val 


Gly 








460 










Phe 


Val 


Thr 


Thr 


Arg 


Asp 


Asp 


Gly 






475 










480 


Asp 


Thr 


Arg 


Tyr 


Val 


Met 


Leu 


Ala 




490 










495 




Tyr 


Val 


Glu 


Leu 


Lys 


Thr 


Asp 


Thr 


505 










510 






Val 


Asp 


Phe 


Phe 


Leu 


Ala 


Ser 


Arg 



36 



























525 








Glu 


Lys 

J O \J 


Ala 


Glu 


Gly 


Leu 


Gin 
535 


Asn 


He 


Phe 


Tyr 


Asp 
540 


Phe 


Asp 


Lys 


Ala 


Thr 


Leu 


Arg 


Pro 


Glu 


Ser 


Met 


Lys 


Ser 


Leu 


Asp 


Glu 


Leu 


He 


Arg 


He 


54 5 










550 










555 










560 


Leu 


Thr 


Asp 


Asn 


Pro 

J O J 


Asp 


He 


Arg 


He 


Glu 
570 


Leu 


Gly 


Ser 


His 


Ala 
575 


Asp 


Arg 


ljy s 


Gly 


Pro 
s ft n 

Jul) 


A.S p 


Ala 


T v r 


As n 


Leu 
58 5 


Gl v 


Leu 


Ser 


Asp 


Arg 
590 


Arg 


Ala 


Lys 


Ser 


Val 


Val 


Asp 


Tyr 


Leu 


Thr 

D VJ U 


Ser 


Arg 


Gly 


He 


Ala 
605 


Ala 


Asp 


Arg 


Leu 


Thr 


Trp 


Lys 


Gly 


Tyr 


Gly 

D1J 


Lys 


Ser 


Val 


Pro 


Lys 
620 


Thr 


Val 


Thr 


Ala 


Lys 


He 


Ala 


Glu 


Arg 


His 


Asp 


Phe 


Leu 


Lys 


Glu 


Gly 


Asp 


Val 


Leu 


Thr 


625 










630 










635 










640 


Glu 


Glu 


Phe 


Val 


Ala 
645 


Pro 


Leu 


Thr 


Glu 


Glu 

650 


Gin 


Gin 


Ser 


Val 


Cys 
655 


Asp 


Gin 


Leu 


Asn 


Arg 
660 


Arg 


Thr 


Glu 


Phe 


Arg 
665 


Val 


He 


Glu 


Glu 


Glu 
670 


Leu 


Arg 



(2) INFORMATION FOR SEQ ID NO: 26 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 324 amino acids 
{ B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY: miscjeature 

(B) LOCATION 1. . • 324 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 



Met 


Cys 


Leu 


Glu 


Pro 


He 


He 


Ala 


Pro 


He 


Ser 


Ser 


Glu 


Leu 


Leu 


Glu 


1 






5 










10 










15 




Gin 


Glu 


Leu 


Thr 


Ala 


Asp 


Arg 


Phe 


Leu 


Arg 


Met 


Thr 


Asn 


Lys 


Ala 


Gly 








20 










25 










30 






Asn 


Glu 


He 


Tyr 


Val 


Phe 


Thr 


Ala 


Glu 


Glu 


Ala 


Pro 


His 


Cys 


Met 


Lys 






35 










40 










45 








Glu 


Val 


Gly 


Arg 


Leu 


Arg 


Glu 


Glu 


Ala 


Phe 


Arg 


His 


Tyr 


Gly 


Gly 


Gly 




50 










55 










60 










Thr 


Gly 


Lys 


Ala 


He 


Asp 


He 


Asp 


Glu 


Phe 


Asp 


Thr 


Met 


Pro 


Gly 


Ser 


65 










70 










75 










80 


Tyr 


Lys 


Gin 


Leu 


He 


Val 


Trp 


Asp 


Pro 


Gin 


Asn 


Lys 


Ala 


He 


Leu 


Gly 










85 










90 










95 




Gly 


Tyr 


Arg 


Phe 


He 


Tyr 


Gly 


Arg 


Asp 


Val 


Ala 


Phe 


Asp 


Thr 


Asp 


Gly 








100 










105 










110 






Lys 


Pro 


Leu 


Leu 


Ala 


Thr 


Ala 


Glu 


Met 


Phe 


Arg 


Phe 


Ser 


Asp 


Ala 


Phe 






115 










120 










125 








Leu 


His 


Asp 


Tyr 


Leu 


Pro 


Tyr 


Thr 


Val 


Glu 


Leu 


Gly 


Arg 


Ser 


Phe 


Val 




130 










135 










140 










Ser 


Leu 


Gin 


Tyr 


Gin 


Ser 


Thr 


Arg 


Met 


Gly 


Thr 


Lys 


Ala 


He 


Phe 


Val 


145 










150 










155 










160 


Leu 


Asp 


Asn 


Leu 


Trp 


Asp 


Gly 


He 


Gly 


Ala 


Leu 


Thr 


Val 


Val 


Asn 


Pro 










165 










170 










175 




Glu 


Ala 


Leu 


Tyr 


Phe 


Tyr 


Gly 


Lys 


Val 


Thr 


Met 


Tyr 


Lys 


Asp 


Tyr 


Asp 



37 









180 










185 










190 






Ara 


Ara 


Ala 


Arg 


Asn 


Leu 


He 


Leu 


Tvr 


Phe 


Leu 


Arg 


Lys 


His 


Phe 


Ser 






195 










200 










205 








Asp 


Pro 


Glu 


Glv 


Leu 


Val 


Lvs 


Pro 


He 


His 


Pro 


Leu 


Pro 


He 


Glu 


He 




210 










215 










220 










Ser 


Ala 


Glu 


Asd 


Glu 


Ala 


Leu 


Phe 


Ser 


Ser 


Ser 


Asp 


Phe 


Asp 


Thr 


Asn 


225 










230 










235 










240 


Tvr 


Lys 


Thr 


Leu 


Asn 


He 


Glu 


Val 


Arg 


Lys 


Leu 


Glv 


He 


Asn 


He 


Pro 










245 










250 










255 




Pro 


Leu 


Val 


Ser 


Ala 


Tvr 


He 


Ala 


Leu 


Ser 


Pro 


Glu 


Met 


Arg 


Val 


Phe 








260 










265 










270 






Gly 


Thr 


Ala 


Val 


Asn 


Glu 


Ser 


Phe 


Gly 


Glu 


Val 


Glu 


Glu 


Thr 


Glv 


He 






275 










280 










285 








Phe 


He 


Ala 


Val 


Gly 


Lys 


He 


Leu 


Glu 


Glu 


Lys 


Lys 


Gin 


Arg 


His 


He 




290 










295 










300 










Glu 


Ser 


Phe 


He 


Leu 


Ser 


Arg 


Asn 


Glu 


Lys 


Lys 


Gly 


Leu 


Asp 


Ser 


Ser 


305 










310 










315 










320 


Asn 


Gly 


Arg 


Ser 



























(2) INFORMATION FOR SEQ ID NO: 27 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 313 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION 1. . .313 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 



Met 


Arg 


Ala 


Asn 


He 


Trp 


Gin 


He 


Leu 


Ser 


Val 


Ser 


Val 


Leu 


Phe 


Phe 


1 








5 










10 










15 




Phe 


Gly 


Thr 


Ala 


He 


Gly 


Gin 


Ala 


Gin 


Ser 


Arg 


Asn 


Arg 


Thr 


Tyr 


Glu 








20 










25 










30 






Ala 


Tyr 


Val 


Lys 


Gin 


Tyr 


Ala 


Asp 


Glu 


Ala 


He 


Arg 


Gin 


Met 


Ser 


Arg 






35 










40 










45 








Tyr 


Asn 


He 


Pro 


Ala 


Ser 


He 


Thr 


He 


Ala 


Gin 


Ala 


Leu 


Val 


Glu 


Thr 




50 










55 










60 










Gly Ala 


Gly 


Ala 


Ser 


Thr 


Leu 


Ala 


Ser 


Val 


His 


Asn 


Asn 


His 


Phe 


Gly 


65 










70 










75 










80 


He 


Lys 


Cys 


His 


Lys 


Ser 


Trp 


Thr 


Gly 


Lys 


Arg 


Thr 


Tyr 


Arg 


Thr 


Asp 










85 










90 










95 




Asp 


Ala 


Pro 


Asn 


Glu 


Cys 


Phe 


Arg 


Ser 


Tyr 


Ser 


Ala 


Ala 


Arg 


Glu 


Ser 








100 










105 










110 






Tyr 


Glu 


Asp 


His 


Ser 


Arg 


Phe 


Leu 


Leu 


Gin 


Pro 


Arg 


Tyr 


Arg 


Pro 


Leu 






115 










120 










125 








Phe 


Lys 


Leu 


Asp 


Arg 


Glu 


Asp 


Tyr 


Arg 


Gly 


Trp 


Ala 


Thr 


Gly 


Leu 


Gin 




130 










135 










140 










Arg 


Cys 


Gly 


Tyr 


Ala 


Thr 


Asn 


Arg 


Gly 


Tyr 


Ala 


Asn 


Leu 


Leu 


He 


Lys 


145 










150 










155 










160 


Met 


Val 


Glu 


Leu 


Tyr 


Glu 


Leu 


Tyr 


Ala 


Leu 


Asp 


Arg 


Glu 


Lys 


Tyr 


Pro 










165 










170 










175 




Ser 


Trp 


Phe 


His 


Lys 


Ser 


Tyr 


Pro 


Gly 


Ser 


Asn 


Lys 


Lys 


Ser 


His 


Gin 



180 185 190 



38 



Thr 


Thr 


Lys 


Gin 


Lys 


Gin 


Ser 


Gly 






195 










200 


Tyr 


Gly 


Leu 


Leu 


Tyr 


He 


He 


Ala 




210 










215 




Leu 


Ala 


Glu 


Glu 


Phe 


Asp 


Met 


Arg 


225 










230 






Asp 


Ala 


Pro 


Val 


Asp 


Phe 


Pro 


He 










245 








Glu 


Lys 


Lys 


His 


Ala 


Cys 


Ser 


He 








260 










Arg 


Val 


Gly 


Asp 


Ser 


Met 


His 


Ser 






275 










280 


Met 


Lys 


Asn 


Leu 


Tyr 


Lys 


Leu 


Asn 




290 










295 




Gin 


Glu 


Gly 


Asp 


He 


Leu 


Arg 


Leu 


305 










310 







Leu 


Lys 


His 


Glu 


Ala 


Tvr 


Phe 


Ser 










205 








Lys 


Gin 


Gly 


Asp 


Thr 


Phe 


Asd 


Ser 








220 










Ala 


Ser 


Lys 


Leu 


Ala 


Lys 


Tyr 


Asn 






235 










240 


Glu 


Lys 


Gly 


Asp 


Val 


He 


Tyr 


Leu 




250 










255 




Ser 


Lys 


His 


Thr 


Gin 


His 


Val 


Val 


265 










270 






He 


Ser 


Gin 


Arg 


Tyr 


Gly 


He 


Arg 










285 








Asp 


Lys 


Asp 


Gly 


Glu 


Tyr 


He 


Pro 



300 



Arg 



